Develop Python script to handle large XML file using SAX parser and load to MySQL tables
$30-250 USD
Completed
Posted about 7 years ago
$30-250 USD
Paid on delivery
We currently have a multi-step process which loads XML data into a MySQL database on a weekly basis. A file size increase of the incoming XML files necessitates a switch to a SAX parser from the current DOM parser.
Current state:
- Existing Python script uses [login to view URL] DOM and loops through 15 weekly XML files, performing a minor transformation to each using an XSLT file, and writes transformed XML to temporary .xml files
- The temporary .xml files are loaded to seven MySQL tables using LOAD XML LOCAL INFILE
Requirements:
- Develop a Python script which replicates the above steps into a single script that uses a SAX parser which can handle XML files with size of at least 2GB
- The script should parse through each of the XML files and load the data to the target AWS MySQL tables
- Script should unescape html elements (&) which exsit in XML prior to writing to database tables (see: [login to view URL])
- Script must process files in batches (7, 7, 1) so that DB scripts can be executed between each batch
- Code should be fully commented
- If applicable, please include instructions for installation of any required modules, etc.
I will provide the following:
- sample XML file
- existing Python script
- XSLT file currently used to transform XML
- SQL which is currently used to load the MySQL tables
Please contact me for more information.
I'm expert in data processing with hundreds of excellent reviews here that's why I'm sure you'll be impressed with my work.
I can create the script you want but I need to clarify one thing: are you talking about writing to MySQL directly from the script or do you still want to generate temporary XML files and load them later into MySQL?
Because unescaping HTML in temporary XML files is not a good idea...
Thanks.
Roman
Hello!
We are a professional team of web developers with huge experience in using python for custom webapps based on django and odoo.
Please provide more details about your project.
We are available and will be happy to help you with the project. Looking forward for further discussion.
Best Regards,
Sergii Savchenko
CEO @ PineDev Studio
Hi there, I'm highly experienced in python/web services.
I can build a more optimized xml parsing script for large files, within about a week.
The exact solution will depend on how the existing service is deployed and available server resources (ram/cpu/threads/os) and processing time constraints.
Please contact me if you would like to discuss the project further.
Hi,
I have worked with python, and with large xml and csv files (upto 10GB in size) for a SAAS service that I was working on.
I am sure I can get python to handle your xml files using SAX parser and get it imported into mysql
let me know if you have any questions.
thanks
kaleshwar.
Hi there,
I am an experienced developer and I can do this for you. Also to investigate a bit more about more efficient ways of parsing data.
Give me the details you were talking about to have a look.
Lopking forward working with you,
Ioan
Computer Engineer. More than 20 years of experience.
Expert in, but not limited to, GNU/Linux, BSD, Computer Security.
Will lead your project to success.
SALUTATIONS
How are you?
I hope you are doing well and the season is going great for you and your business.
I and my team have gone through your requirements for this project. And one thing I can assure you that they will all be fulfilled. My team and me will keep you in the loop at every stage of the project and will ensure to add improvements to project and deliver to you the best possible.
Though my team is new but they are all very efficient and well trained in Python, MySQL, Java and its technologies. I myself have an experience of 4 years, I also handle clients for us 24*7. But in our limited span of experience we have worked on many diverse projects and have always tried to deliver our best. So far we have had no complaints and we strive hard to maintain that.
We would also like to discuss the specifics of the project like the technical discussion and the viewing of the current code. The budget and the time required for completion will be decided at the end. And we ensure you that we will stick to the budget and the project timeline.
My team and I are capable to handle such projects and more than that we are all hard workers and always give our 100 percent to the job at hand.
To conclude, we would like to hear from you as soon as possible regarding the specifics and details of the project.
Below are some of our game projects:
King and Crown
Roulette
Joker Bonus
Fever Joker Bonus
Looking forward to long work relationship with you!!
With Thanks & Regards
Vijay
Below will be my plan:
1. Understand existing script and working as a first milestone.
2. Start working on new requirement and create a basic working prototype.
3. Corner cases and exception handling as a next milestone.
4. Load testing and bug fixes will be the final milestone.
Looking forward to work with you.