Completed

Data Mining Using C++

Please read the project and see the attachments

1 Introduction

This project requires you to explore classi cation algorithms on a real world dataset, and write a

report explaining your experimental results. The language of implementation is to be C++. The other requirements are that your program be able to interpret the data format speci ed below, and

be able to classify instances and produce interesting statistics such as accuracy, false positive rate,

false negative rate, etc. You are free to construct whatever user interface for your program, but

you must fully document your interface.

2 Algorithm

 Your algorithm should be based on the classfi cation algorithms: KNN, Decision Tree, SVM, Naive Bayes, Logistic Regression.

Usually a straight forward implementation of one method will not lead to satisfactory perfor-

mance. Your algorithm can be a combination of methods and should incorporate one or more

data mining techniques when the situation arises. These techniques include (and certainly

not limited to):

{ Handling imbalanced dataset

{ Proper imputation methods for missing values

{ Di erent treatment of various type of features: continuous, discrete, categorical, etc.

3 Data

You'll be examining the behavior of your model on a dataset from the UCI machine learning lab.

The dataset is represented in a standard format, consisting of 3 les. The rst le, [url removed, login to view],

describes the categories and features of the dataset. It also has some empirical results for your ref-

erence. The other two les are [url removed, login to view] and [url removed, login to view], containing the

actual data instances, formatted at one instance per line, as follows:

1

F1

1 ; F2

1 ; : : : ; Fk

1 ; label1

F1

2 ; F2

2 ; : : : ; Fk

2 ; label2

...

F1

n; F2

n; : : : ; Fk

n ; labeln

where Fj

i , labeli (i = 1; : : : ; n; j = 1; : : : ; k) represent the value of the jth feature and class category

for the ith instance respectively.

The data you will be examining was extracted from the census bureau database. Each instance

contains an individual's educational, demographic and family information. Prediction task is to

determine whether a person makes over 50K a year. You should use [url removed, login to view] to

train your classi er and use [url removed, login to view] to evaluate the performance of your learning

algorithm.

4 Your Mission...

Deliverables for this project are:

 Code to implement the classi cation algorithm for the data le formats given above

 A README le, with simple, clear instructions on how to compile and run your

code

 Testing statistics for the application of your learning algorithm. At a minimum you should

provide training set accuracy, test set accuracy

 A discussion of data mining techniques employed in your algorithm

 A report analyzing the behavior of your algorithm on the dataset, including any unusual or

anomalous (in your opinion) behavior

Skills: Algorithm, C++ Programming, Data Mining, Machine Learning

See more: mlc++, c++ for machine learning, machine learning c++ tutorial, machine learning for beginners pdf, data mining programs in c, binary tree in data structure using c, binary search tree in data structure using c, arrays in data structure using c, array in data structure using c, data mining using, web data mining using python, data mining using matlab, project data mining using matlab, data mining using mysql, data mining using php, classification data mining using java, data mining using mine web, data mining using spss, data mining using clementine, data mining using aspnet

About the Employer:
( 8 reviews ) shanhai, United States

Project ID: #15791812

Awarded to:

Yknox

I'm interesting your project very well I'm a Good C++, Java, Math, ML, Algorithm expert. I m quite well experienced in these jobs. Let's go ahead with me I want to service for you continously. Relevant Skills and Expe More

$210 USD in 2 days
(511 Reviews)
8.6

11 freelancers are bidding on average $101 for this job

dinhfreedom

---Very Nice Job. Professional Data processing &Object tracking& MachineLearning expert. Best result in time----- Relevant Skills and Experience [login to view URL] I am very interesting for your project because I have rich ex More

$100 USD in 1 day
(18 Reviews)
5.1
$35 USD in 3 days
(36 Reviews)
4.8
Programmer59

Hello Sir I will do your work and i will assure you a quality work , i have a team of professional developers. Relevant Skills and Experience i am expert in Algorithm, C++ Programming, Data Mining, Machine Learning More

$155 USD in 3 days
(10 Reviews)
4.1
$155 USD in 3 days
(12 Reviews)
4.8
$100 USD in 3 days
(10 Reviews)
2.7
$119 USD in 3 days
(1 Review)
2.0
fastlabindia

I am good in , JAVA, ASP, DOT NET , Android, Java, C/C++, AJAX, JavaScript, C#, Visual Basic, JQUERY and etc Relevant Skills and Experience I am good in , JAVA, ASP, DOT NET , Android, Java, C/C++, AJAX, JavaScript, C More

$30 USD in 1 day
(5 Reviews)
3.0
bturner7

--------------------------------------------------Professional C++ Expert! Best Result in Time!-------------------------------------------------- Relevant Skills and Experience C++, Data Mining Proposed Milestones $1 More

$150 USD in 3 days
(2 Reviews)
1.2
$30 USD in 3 days
(0 Reviews)
0.0
$30 USD in 5 days
(0 Reviews)
0.0