Collecting the data from Twitter using the Twitter API and defining keyword based queries
extracting opinion words from Twitter based on input queries
Then save to file Datasets
The following points are examples of noisy data that must be removed.
Data Preprocessing and Normalization
Spam tweets which are tweets that contain advertisements or harmful links
Retweeted tweets, which start by “RT”
Duplicated tweets, which were retrieved more than once.
URLs which started by http:// until the next space,
Opinions unrelated to input queries.
Then save to file CleanData
27 freelancers are bidding on average $36/hour for this job
Hi, I can implement it in PHP. What is the dataset for the "opinion words" ? Do you have it? or it is using any external APIs (besides twitter API, of course). Sorin
hi, I have worked with Twitter API using python. I will use python to do the tasks as you mentioned. What type of Dataset file are we talking about? Lets discuss more. thanks