delphi html parser

Closed Posted May 25, 2010 Paid on delivery
Closed Paid on delivery

The goal of this project is to make an "intelligent" html parser to extract data from HTML pages.

This parser should be able to automatically extract data such as:

companyName, address, email, fax, tel, website

this parser must be able to extract N times these data, since html pages will contain tablular data. (N data per page).

[url removed, login to view]();

while ([url removed, login to view]()) do begin;

data:=[url removed, login to view]();

// data should be an object or type like

// [url removed, login to view], [url removed, login to view], [url removed, login to view], [url removed, login to view], [url removed, login to view], [url removed, login to view]

end;

I think a good knowledge of DOM and og REGEX is necessary.

of course it will not work on ALL websites, but should be universal enough.

should work with data from

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

[url removed, login to view]

etc..

I think the good startegy would be:

1) find a repetitive fragment in the DOM (when a page contains 20 results, it should extract 20 HTML blocks)

2) apply a parser to each block that contain data to be extracted

Should be DELPHI 6 compatible.

Delphi Engineering Microsoft Project Management Software Architecture Software Testing Windows Desktop

Project ID: #3451768

About the project

11 proposals Remote project Active Jun 16, 2010

11 freelancers are bidding on average $425 for this job

IWSolutions

See private message.

$425 USD in 14 days
(101 Reviews)
6.7
kraneware

See private message.

$425 USD in 14 days
(8 Reviews)
5.9
PaulFarr

See private message.

$425 USD in 14 days
(33 Reviews)
4.9
vw7437936vw

See private message.

$425 USD in 14 days
(19 Reviews)
4.1
powzak

See private message.

$425 USD in 14 days
(28 Reviews)
4.1
devdlrb

See private message.

$425 USD in 14 days
(1 Review)
0.7
myimservices

See private message.

$425 USD in 14 days
(0 Reviews)
0.0
heidelguest

See private message.

$425 USD in 14 days
(1 Review)
0.0
secureenix

See private message.

$425 USD in 14 days
(0 Reviews)
0.0
abeloqp

See private message.

$425 USD in 14 days
(0 Reviews)
0.0
bluesoftcoders

See private message.

$425 USD in 14 days
(3 Reviews)
2.2