I need to be able to process several files on a website and retrieve certain fields and capture the data onto a MS Access database.
To illustrate, in the URL below:
<[login to view URL];view=commentary>
I need to be able to capture over number, bowler (the words after the over number and before 'to'), the batsman (the words after to and before the comma), and the runs (number or text after the comma). In a few instances, there could be other text - like FOUR (which needs to be captured as 4 in the database), or 1 no ball, which needs to be captured as 0 in the runs column and marked as 1 and NB (in two fieds - extra_runs and extra_type) or OUT (which needs to be flagged in a separate column dismissal with 'y' flag).
Along with the values mentioned above, there will be few other details which will be driven out of a config file - these include match number, innings number, match venue, bat team and bowl team.
Further, details about the URL and start ball number (0.1, etc) and end ball number will be made available in the same config file.
## Deliverables
Processing needs to happen quickly.
Critically, the application should allow for processing of the data in batches. So, in one go, I should be able to process upto ten different links.
Another critical thing, development needs to happen within 48 hours - I will test out the application entirely in the next 48 hours and report back with bugs which need to be fixed in the next 48 hours.
Only coders who can meet these timelines please apply.