php - Which of these three options would be fastest to populate my DB table? -



php - Which of these three options would be fastest to populate my DB table? -

i have (huge: 6.4 1000000 rows) database table 3 columns follows:

url title description

right url column populated, various urls around web. have add together title , meta description url in each row. there 3 ways see can this:

option 1: utilize php , curl go each url , grab title , meta:description. (even using options 2 or 3 i'll have @ to the lowest degree part of urls).

option 2: have dmoz xml file has several 1000000 elements this:

<externalpage about="http://animation.about.com/"> <d:title>about.com: animation guide</d:title> <d:description>keep developments in online animation skill levels. download tools, , seek inspiration online work.</d:description> <topic>top/arts/animation</topic> </externalpage> <externalpage about="http://www.toonhound.com/"> <d:title>toonhound</d:title> <d:description>british cartoon, animation , comic strip creations - links, reviews , news uk.</d:description> <topic>top/arts/animation</topic> </externalpage>

etc.

i can utilize xmlreader compare url in every externalpage-about see if matches 1 of 6.4 1000000 urls in table, , if does, add together title , description.

option 3: can write above dmoz info separate database table (again using xmlreader), take time, , somehow utilize joins , selects on 2 tables info need.

would alternative 2 or 3 quicker urls using alternative 1 urls? if yes, of 2 options faster?

assuming "faster" means less time update url table, alternative 3 seems best option.

you can populate table records xml , using bring together update main table.

php mysql curl xmlreader

Comments

Popular posts from this blog

web services - java.lang.NoClassDefFoundError: Could not initialize class net.sf.cglib.proxy.Enhancer -

Accessing MATLAB's unicode strings from C -

javascript - mongodb won't find my schema method in nested container -