JimWorld Forums: data conversion tool recomendation, please



Posted By: leicaphotos ()
Posted On: 05/14/2008 05:36 am

I have roughly 2 gig of short html files (40/50 lines of html)
and am looking for a utility to do the following:

1) Strip out the html or rewrite it to my specs.

2) Drop the file into a database as a record... not real worried about which format -- final product will end up sql- -- if I can get it to any data driven format, I can convert fairly easily...

3) everything is currently indexed and searched by index server (not my choice) which presents an ugly presentation due to old design...


any software recomendations to speed up this process? I feel like I've been given the task of counting the grains of sand on the beach

Thanks
Blake



Posted By: Prowler (Staff)
Posted On: 05/16/2008 07:45 am

There may not be a readily available utility which will do the job. But you can write a Perl script which will fetch each file from a given directory (where you dump all the 2 gig html files if your OS will permit) and write out new files after stripping out the HTML tags.



Posted By: leicaphotos ()
Posted On: 05/19/2008 06:38 am

reading the files from the directory does no good --- it still leaves the problem of a good search --- I've been given the task of categorizing these and the business needs dictate a database as I'm going to be adding fields -- it's 2 gig of recipes


Posted By: Dinkar (Staff)
Posted On: 05/19/2008 07:47 pm

Visit elance.com or getafreelancer.com to find someone to do this job.



JimWorld Forums © 1996 - 2004 .... iWeb Technology, Jimworld.com