Find Jobs
Hire Freelancers

Data Mining - Scrape a database from a website

$30-250 USD

In Progress
Posted over 15 years ago

$30-250 USD

Paid on delivery
I need you to make a script that will run either [login to view URL] or [login to view URL] and populate a mysql database (the reason that this needs to be a script is there are probably 700,000+ entries in total). I prefer to the script to be in PHP, but if you have other, faster methods, then you can use those as well. The scraped data should go into 2 tables in mysql. 1st table: artists fields: a_name (name of the artist, i.e. "Britney Spears"), a_id (incremental artist ID 1 by 1 starting from 1) a_alias_plain (url field - it'll be structure "artist-name" the multiple words are separated by dashes. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word) a_alias_lyrics (url field - it'll be the structure "artist-name-lyrics", mutliple words are separated by dashes and "-lyrics" is appended at the end. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word) 2nd table: songs fields: s_id (id of the song, incremental 1 by 1 starting with 1) s_name (the name of the song, i.e. "Feel The Way") s_text (the actual text of the song, I only want the text and not any other stuff on the page) s_artist (this is going to be the Artist's ID from a_id - this is so that I can associate which song is for which artist) s_alias_plain (this is an url field - structure is "song-name", each word is separated by dashes. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word) s_alias_lyrics (this is the 2nd url field just in case, each word is separated by dashes with "-lyrics" appended at the end. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word) Database should have proper collation so that all special characters are displayed. The whole database should probably have 700,000+ entries. I don't want to wait more than 5 days, so if you can complete it within that time frame, feel free to bid. I am not paying more than $100 so please don't bid higher. I need to start as soon as possible, so if you give me a good bid, you could even start working today. Please only bid if you have read the requirements fully.
Project ID: 323328

About the project

15 proposals
Remote project
Active 16 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
15 freelancers are bidding on average $154 USD for this job
User Avatar
Please check PMB.
$100 USD in 5 days
4.8 (289 reviews)
8.2
8.2
User Avatar
Highly excited to do this job. Please check PMB.
$100 USD in 2 days
5.0 (91 reviews)
7.6
7.6
User Avatar
I am 5 year experienced Linux based programmer. See PM for Details
$80 USD in 2 days
4.8 (157 reviews)
7.1
7.1
User Avatar
Hi, Kindly have a look at PM, Thanks.
$100 USD in 0 day
4.9 (103 reviews)
6.0
6.0
User Avatar
Hi,please check PM.
$250 USD in 2 days
5.0 (7 reviews)
4.9
4.9
User Avatar
I can help with that
$100 USD in 4 days
4.9 (22 reviews)
4.6
4.6
User Avatar
We can do this for you. The task is interesting. Please let us know to which site you wanna give priority to fetch data from the given two. The task is not critical at all. but it should get completed with proper care as we have to deal with html formates to fetch data. Pelase check your PM, We are ready to start with :) and just waiting for you to select us. We will definetly deliver you the expected output with no compromise. Regards
$230 USD in 5 days
5.0 (7 reviews)
4.2
4.2
User Avatar
Please check your PM.
$100 USD in 4 days
5.0 (1 review)
3.2
3.2
User Avatar
Hello, This is a placeholder bid - please see pm for details. regards, Satsco.
$100 USD in 5 days
5.0 (2 reviews)
1.7
1.7
User Avatar
We will deliver you what you need in the specified time with 100% surety of Clear Data . Waiting for your PM Thanks
$245 USD in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I can do that with another method fast method. Contact me for details
$200 USD in 1 day
0.0 (0 reviews)
0.0
0.0
User Avatar
Hello I can realize it on Perl in 2 days max
$50 USD in 2 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hi! I have Kapow Mashup Server to do it faster. But i need win or linux server to run bot.
$250 USD in 7 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
Brooklyn, United States
5.0
381
Payment method verified
Member since Aug 25, 2008

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.