Find Jobs
Hire Freelancers

DATA SCIENCE, python

€30-250 EUR

Completed
Posted over 8 years ago

€30-250 EUR

Paid on delivery
DATA SCIENCE 1)Download 5 clustering datasets from UCI machine learning repository ([login to view URL] mIns=&type=&sort=nameUp&view=table). a) Run k-means algorithm with different numbers of clusters, including the correct of the crowd. Use Euclidean similarity function. Make diagrams of convergence of the algorithm, and distribution of data in clusters. Compare the latter with the proper distribution of data in clusters, as given by the creators of datasets («groundtruths»). What do you notice as it changes the number of clusters compared with the value of the objective function? How do you explain? b) Repeat the above using Gaussian mixture models and answers to all respectively ruling the question (a). Consider in your analysis as full as and diagonal covariance matrices. What differences in performance between the two setups; 2. Based on the relative demo of lesson1, use the corpus 20newsgroups dataset to train a model of learning topics using nonnegative matrix factorization. As in the demo, recreate every text through tf-idf representation, wherein each document is represented by a combination of relative frequency of each token in a text, and reverse the relative frequency in all the texts of the corpus. a. Try algorithm with at least 5 different number of options latent features. What do you observe regarding the convergence and the computational costs? That you give? b. Try algorithm with at least 5 different number of options inferred topics. What do you observe regarding the convergence and the computational costs? That you give? c. What happens if we increase the number of samples for convergence and computational costs? That you give? 3. Build a system that provides 4. Download the datasets from: https://archive.ics.uci.edu/ml/datasets/UJI+Pen+Characters. Use HMM models which will be trained to these data to make a system that can recognize handwriting. Try HMM models with different many states. What do you notice on the convergence and recognition accuracy, and computational costs? How do you interpret the phenomena based on the theory that you have taught? 1 http://scikitlearn. org/stable/auto_examples/applications/topics_extraction_with_nmf.html#exampleapplications- topics-extraction-with-nmf-py
Project ID: 8934597

About the project

11 proposals
Remote project
Active 8 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Dear Madam or Sir, I have great experience in mcahine learning and Python. As a mathematician and software developer, I understand both the theoretical and the practical part of machine learning. I have implemented k-means, the expecation maximazation algorithm for gaussian mixture models from scratch in Matlab and Python. Moreover, I am familiar with the build in function from scikit-learn that should be used here. You will get the Python source code with lots of comments, the plots, and the answers for all question for each task. I am looking forward to discuss your exact needs in the chat. Best regards
€44 EUR in 3 days
5.0 (7 reviews)
3.7
3.7
11 freelancers are bidding on average €195 EUR for this job
User Avatar
HI i am a python software developer. I have been working on python for last 3 years. I have good knowledge of this technology. I also have good understanding of data mining and clustering algorithm. Looking forward for your response. let chat about this
€150 EUR in 5 days
4.9 (20 reviews)
5.0
5.0
User Avatar
A proposal has not yet been provided
€252 EUR in 3 days
5.0 (12 reviews)
5.2
5.2
User Avatar
New to freelancer but with master degree in machine learning. I've been using pandas, Numpy and scikit over 3 years now. hope we will cooperate !
€155 EUR in 3 days
5.0 (15 reviews)
4.3
4.3
User Avatar
A proposal has not yet been provided
€200 EUR in 3 days
4.8 (3 reviews)
3.0
3.0
User Avatar
I helped people in sussex, george mason, kings london in their works related to natural language processing, predictive analytics, document classification etc and did them on time with 100% accuracy.I believe i can help you as well on time. my milestone plan is designed to reduce any risk to you. Bytheway, your 3rd point is not complete. please share that ,if its complex it may change the bid/ timeline.
€250 EUR in 10 days
5.0 (2 reviews)
3.0
3.0
User Avatar
Hi, I am a Computational Biologist. I have advanced skills in R and Python and a lot of experience in machine learning algorithms and statistics on biological datasets. Is it have to do it in Python? Happy to help you with your assignment. Cheers, Narendra Meena
€277 EUR in 15 days
5.0 (2 reviews)
1.3
1.3
User Avatar
A proposal has not yet been provided
€155 EUR in 3 days
5.0 (1 review)
1.1
1.1
User Avatar
A proposal has not yet been provided
€277 EUR in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
A proposal has not yet been provided
€222 EUR in 6 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Dear, I've read your job description and understood the job. I've experience in analysis of yelp review. I think that Kmeans is represented for descrete data clustering and GMM is for continuous data. GMM is some more correct than Kmeans in UCI dataset. TFIDF is one of topic models, but it is less than LSA and LDA. If you hire me, I will finish it on time. Let's contact on freelancer and discuss some more details. Regards.
€166 EUR in 3 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of CYPRUS
limnassol, Cyprus
0.0
0
Payment method verified
Member since Nov 18, 2015

Client Verification

Other jobs from this client

Data Science
€30-250 EUR
Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.