Network Analysis on Twitter Streams Open Details

Create a python script about:

1. Using the following tweets dataset, identify the most influential users using the PageRank algorithm:

Link: [login to view URL] (Files from 2020/01 to 2020/05)

See attached file for detail.

List the 100 most influential users (user id, user name, PageRank score).

PageRank example implementation script: [login to view URL]


You should use the Spark GraphX or GraphFrames library.

Limit your analysis to tweets in Portuguese. (Language filter)

To create the network, create an edge pointing from a retweeter to the user who tweeted.

2. Process a Twitter stream to identify the most popular users within a time window.

a) Continually collect the stream using the filter endpoint of the Twitter Streaming API to select

tweets from the United Kingdom.

Suggestion: Use the bounding box [-8.6, 49.5, 1.46, 60.5]

b) Apply the exponentially decaying window approach to keep smoothed counts of mentions in the

collected tweets, and display the 10 most popular users in (user-defined) intervals of t seconds.

See example tweets and output (10 sec window; some outputs removed) below.

c) Apply a sentiment classifier to the collected tweets, as they arrive, and keep two counts

to track the most popular and unpopular users

Skills: Python, Machine Learning (ML), Spark, Data Mining, Data Analysis

See more: oscommerce remove company details create account, services open details cad detail, network card game java open source code, open search create, real time network analysis, ruby social network analysis, twitter clone open source php, autocad electrical network analysis, based approved epn accountpost include details create account steps prevent linking accounts , captcha workerssl details create account, details create magazine, free competitor social network analysis tools, open reality create template, open source sentiment analysis twitter, twitter network analysis python, twitter network analysis, twitter network analysis tool, twitter network analysis r, twitter social network analysis tutorial

About the Employer:
( 3 reviews ) Aveiro, Portugal

Project ID: #25648373

2 freelancers are bidding on average €14 for this job


I am a data scientist and have more than 4 years of experience in machine learning and statistical analysis of data using R and Python. I have multiple times worked with Twitter Dev API and can help you conduct data cl More

€19 EUR in 1 day
(39 Reviews)

Hello, I am Python expert and I can complete this job for a short time. Please contact me to discuss more.

€8 EUR in 7 days
(0 Reviews)