Find Jobs
Hire Freelancers

Crawl an entire website and convert to PDF

$100-500 USD

Cancelled
Posted about 13 years ago

$100-500 USD

Paid on delivery
Create a webform that accepts 2 required values (e-mail and website URL), 2 optional values that must be entered together (username and password), and an 'execute' process button. For example, the e-mail addr provided is: <john@[login to view URL]> and the url is: <[login to view URL]>. Username: Jeffery_Fac7s Password: H%bYq@) All email and URL values would have basic value validation checking. Username and PW fields should accept special characters. Upon entering both values, user clicks 'execute' button Also create web API that can accept the above (4) values. Store email address Crawl website URL (<[login to view URL]>) with no depth limit within the domain ([login to view URL]) Must also be able to enter pw protected areas with supplied username/pw credentials prompted by textbox or within url (<http://username:password@[login to view URL]>) Convert HTML, images, css, script (php/xml) into PDF. In other words, generate a 'snapshot' of what a browser would display into a pdf. Combine all these pdf pages into a single pdf document Name file <[login to view URL]>-<mm-dd-yyyy>-<24hr:min:sec>.pdf Upload (ftp) pdf document onto supplied web server. If work order entered through webform: Generate retrieval URL Send retrieval URL to stored email address (<john@[login to view URL]> ) originally provided in step 1 with unique transaction number in subject line and body. If work order entered through API: Return PDF payload back over open http connection. In case of timeout, fall back to email delivery described above. Support: We can provide server support, Windows or Linux (Centos) OS Platform, software tools. Linux implementation preferred. ## Deliverables Suggestions: You may want to first create entire url list of website. -There is no limit to depth of links, but any links that don't match primary domain ([login to view URL]) will be omitted. With url list, recursively open each url and then execute the conversion. There are some websites that may help: URL List: <[login to view URL]> HTML to PDF conversion: <[login to view URL]> <[login to view URL]> Adobe Acrobat 9+ also provides a mechanism to convert websites into PDF files.
Project ID: 3082667

About the project

4 proposals
Remote project
Active 13 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
4 freelancers are bidding on average $438 USD for this job
User Avatar
See private message.
$170 USD in 17 days
4.8 (15 reviews)
4.0
4.0
User Avatar
See private message.
$391 USD in 17 days
0.0 (1 review)
0.0
0.0
User Avatar
See private message.
$467.50 USD in 17 days
0.0 (3 reviews)
0.0
0.0
User Avatar
See private message.
$510 USD in 17 days
0.0 (0 reviews)
0.0
0.0
User Avatar
See private message.
$382.50 USD in 17 days
0.0 (2 reviews)
0.3
0.3

About the client

Flag of UNITED STATES
San Jose, United States
0.0
0
Member since Feb 21, 2011

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.