I need a simple web spider and data scraper for web site:
<[login to view URL]>
Please read detailed description below...
## Deliverables
I need a simple web spider and data scraper for web site:
<[login to view URL]>
That will also work for sister sites:
<[login to view URL]>
<[login to view URL]>
<[login to view URL]>
<[login to view URL]>
The purpose of the crawler is to find STREAM adresses by digging into playlists that are presented on the list as a hyperlinks.
For example, <[login to view URL]>
Radio 1 has several streams listed. Their URLs point to:
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
First link gets you Windows Media Player playlist, and other three lists are winamp lists. Playlists should be read and stream url's from the parsed and reported.
Output should be an XML document with a table containing following data:
- Station Name
- Station Web Site
- Station Stream Type (look at icons below)
- Station Area (Europe, Canada, US, New Zealand, Australia)
- Country/Area
- Location
- Stream URL
![][1] WMA ripper
![][2] shoutcast ripper
![][3] OGG ripper
![][4] also shoutcast ripper
Since web site is small, spider should work from memory.