Find Jobs
Hire Freelancers

Apache Nutch Expert needed to diagnose the problem with partial crawling.

$10-30 USD

Closed
Posted over 9 years ago

$10-30 USD

Paid on delivery
I have a Apache Nutch 1.7 application running on Hadoop YARN 2.3.0 , I am facing two problems. 1. I have 10 urls(domains) in my seed list Nutch only cralws 5 of them. 2. The 5 domains that are being crawled in step #1 are being crawled only partially , meaning about only 5 to 10 % of the possible pages are being crawled. I believe this is a problem , because of some configuration issues , I have plaed with different values of depth and topN while starting the crawl , but still faced the same issue, I need someone to help me point out the problem.
Project ID: 6355843

About the project

2 proposals
Remote project
Active 9 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
2 freelancers are bidding on average $98 USD for this job
User Avatar
A proposal has not yet been provided
$105 USD in 3 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Dear Sir, I'm interesting in your job. I have much experience in developing the website and maintaining web server.. I can do this job. Best regards.
$90 USD in 3 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
Plano, United States
5.0
3
Payment method verified
Member since Dec 1, 2002

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.