Hello and thank you for taking the time to review this opportunity. Let me preface with saying I have already accomplished the goal of converting this data and will include an example of what I did. The problem is I need to have a solution that works much FASTER parsing/concatenating the data I am capturing.
This is how I envision your solutions to function:
1. Detect new file creation
a. I am capturing data and can export this data every 20-60 seconds into a local directory we agree on.
i. I can export this data in the following format of your choice (CSV is my preference)
1. CSV
2. HTML
3. SQL insert
ii. SQL drop/create
iii. XLS
2. Using excel services/access services (of whatever you find best) I need to detect when a new file has been created and this should be the trigger to this data conversion workflow
a. For the sake of this example we will use “C:\dataconvert\exports\scrapers” as our trigger folder
b. Once the new file has been written to disk (this file can take 10-15 seconds to write so import trigger event must not run into filelocking issues, allow the export to complete before running this data convert
3. Convert data into access dynamically merging into Table
a. I would like to discuss briefly on best strategy to this (2-5million parts)
Also, I would like to create a master lookup table of categories that will allow me to find the “closest” match to a lookup. “NOTE: Vlookup/match/offset/ect are not quite adequate, I need a proper VB script with a little logic to it.
I have spreadsheets from a Web Extraction I built and need to convert it from its current table format to xml.
Since i have this data in excel 2010 (i will save a backwards compatible file for your download), i was thinking this would be best completed using a VB macro in excel. I am open to other solutions if it’s easy for you to develop.
I expect decent documentation in the code to allow me to understand the functionality (i am savvy so basic explanation will suffice)
The spreadsheet I created is DATA INTENSIVE and YOUR CODE MUST BE CLEAN AND MOST IMPORTANT IT MUST BE FAST FAST FAST AND DOING THIS. Please focus on making the code run fast.
This will be run on a standalone I7 950 on a Sabertooth board with 24gb of PC3 memory.
Please don’t hesitate to ask me any questions. I have included sample data which I already converted…(I included that as well)
Please feel free to use any type of solution you wish, but it must start as CSV and end up merged into an ACCESS 2007/2010 database.
If we work well together I would also like to inquire about remote teaching. I have development servers I can give you remote access to and we can either chat via phone or VIOP and you can tutor me on a few roadblocks I’m finding in Access, excel and if you have experience regular expressions/Java as well as Sharepoint 2010 server and CRM 2011. I’m integrating a substantial amount of data into them.
I am the owner and founder of my company and keeping my eyes and ears open. Hopefully I find someone for years of solid freelancing. If you’re looking for a fast growing challenging and outside the box opportunity, you should also contact me.