Find Jobs
Hire Freelancers

Machine Learning Lab: Binary Classification -- 2

$10-30 USD

In Progress
Posted almost 4 years ago

$10-30 USD

Paid on delivery
Hello there, I would like somebody who has done binary classification machine learning in the past to predict income (0-1). This problem must be easy for the individual, since the task must be completed in a short time. I would like the task to be done in a Python Jupyter notebook. There are NaN values in both categorical and numerical columns in the dataset. Columns and rows containing only NaN values must be dropped. Including one duplicate column. You should be left with NaN values in only two of the columns. The missing values for the categorical column and numerical column must then be imputed using an appropriate machine learning models. (not mean/mode etc.) The categorical features should also be encoded in the most "intelligent" way. Taking nominal/ordinal features into consideration and deciding on the best route for the dataset (or more than one method can be used and the best one decided on based on model performance at the end when changing between the two). Perhaps one-hot encoding will cause high dimensional for features with more than 4 categories? If you are an experienced data scientist, you should be able to gauge what is best here. Feature selection should then be done. And then the prepared data set should be fitted to different appropriate model/algorithm based on what would be best in this case. The dataset should be split for training, testing and validation. About 4-5 different models should be applied and the output checked/validated with various metrics and a visual to compare. Some text should be added to note why decisions were made and why a certain model/algorithm was decided on. Also shortly discuss the metrics considered to evaluate the performance of the model. Comment should be made throughout so that the user would understand the solution of the tutorial. The original tutorial came with a rubric, but no solutions manual. Please use the rubric to guide you. Thank you. PS: The Max file size was exceeded when adding the Rubric. I will share that once you have been selected for the task.
Project ID: 26807240

About the project

4 proposals
Remote project
Active 4 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Hello, I will be delivering the project as the requested needs of it. Hoping to have a chance, and thank you.
$25 USD in 3 days
0.0 (0 reviews)
0.0
0.0
4 freelancers are bidding on average $38 USD for this job
User Avatar
Hi! Hope you are doing great. I can help you in this project. Please send me a message to discuss more. I am available to start working on the project immediately
$80 USD in 7 days
5.0 (9 reviews)
4.0
4.0
User Avatar
Hey I am a professional data scientist with 5 years of experience. I hold an MBA and first Degree in statistics which provides me with the necessary background to handle your project. Having done various projects using spss, R, python, I can deliver quality and superior work at a price we are both comfortable with and within the agreed timeline. Kindly send me text
$30 USD in 3 days
4.9 (5 reviews)
3.7
3.7
User Avatar
I am a specialist in the area and have worked on multiple digital transformation and artificial intelligence projects
$18 USD in 7 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of SOUTH AFRICA
Cape Town, South Africa
0.0
0
Member since Aug 2, 2020

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.