Hello
Bernhard Zwischenbrugger wrote:
Hi
Some questions
As part of my research under my professor I have to implement a web
interface to their benchmarking data.
PHP is the chosen web language but we are little worried about the
database. The benchmark data comes to us in XML format (e.g.
http://www.matf.bg.ac.yu/~filip/ArgoLib/smt-lib-xml/Examples/FolEq1.xml).
We have to implement an interface to query them, get data, update etc.
You can parse the XML, extract the data, put it to an SQL DB and move
the XML to /dev/null (delete it).
If you do that, you don't need an XML DB.
Is this possible?
No. My professor is dead against that. Many people have suggested me
doing that. Why? Are XML databases incorrectly implemented or are bad?
We even can change schema in the form of attributes. . The data size
would be around 100 MB each XML with around 100 different XMLs.
What do you mean by "different XMLs"?
Are you looking for a maschine that makes SQL Tables from XML?
What is inside of the 100MB XML? Your example is a MathML Formula.
By different XMLs I meants, different XML files. So we can have 40 XML
files with around 50-100 MB each.
Yes, they will have lot sof those MathML formulas. Its benchmarking data
from some theoritical group that my professor works with. They have all
their database in XML so relational database is not possible otherwise
we will have to convert them between XML and relational all the time.
The load would be max 5-10 users any given time, batch updates once a
month and heavy load probably 2-3 times a month. Mission criticality is
not important, we can get it down sometimes. Which db would you suggest?
I did Google research and as of now - I like eXist, Sedna (they seem to
have good PHP wrapper support) and Timber. Another thing would be good
documentation and support.
With an XML DB you can query data using XPATH. Is that the thing you
want? Oracle supports that for example.
Yeah but looking at
http://www.oracle.com/technology/tech/xml/xmldb/index.html, I could not
find whether its free. I might be wrong but the info is not easily found
there.
One the contrary IBMs offering at
http://www-306.ibm.com/software/data/db2/express/download.html looks FREE.
The problem is that both the above two database are beasts in themselves
and I just require 10% of what they do :)
We looked into Berkeley DB also but their support for PHP is not that
great. We have compile the module by ourselves etc (this is not a
problem though as we have the technical know how to do that) but lot of
people have suggested that its not a very stable system. I have not done
any benchmarking on it but I dont want to change my underlying DB couple
of months down the line just because we found out its not stable.
Apart from the above big three, the other free and reasonable good
implementation seems to be eXist, Timber and Sedna. This I am just
saying by reading their website. I have not used them.
So my question is: out of the six systems listed above, which one you
would suggest? Has anybody used any of the above system before? What are
your experiences?
Bernhard
Ritesh
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php