Introduction:
We are developing an algorithm to react to certain value changes, right now we are using pure Python
for calculation results but would like to start using data analysis libraries such as Numpy and Pandas
for faster compilations. To make multiple tests changing variables we will need to optimize our code/model
with vectorization or with better data preparation in general.
Each data entry in the database is approximately 30 seconds apart from each other.
This means they should look like this (showing relevant values only).
[
{
date: '06/18/2019 20:00:00',
data1: Number,
data2: Number,
...
},
{
date: '06/18/2019 20:00:30',
data1: Number,
data2: Number,
...
},
{
date: '06/18/2019 20:01:01',
data1: Number,
data2: Number,
...
},
...
]
Current state:
At the start of our algorithm we get all the data necessary from the DB in a single array
and iterate through it comparing them with an emulated date (variable, ex. 15 days from now) and adding 30 seconds each loop,
this emulates the calculations like if it was live on that date.
Through each loop we make a series of calculation, making a backtest of 2 week takes in average
8 minutes but would like to reduce that number as much as possible.
Main part:
For the algorithm to run correctly we need to get an array of objects, each object contains the following from each entry:
{
date: (Date object of data gathered),
data1: Number,
data2: Number
}
With this data we join them in periods (variable, example: 10 periods of data) divided in a
defined timeFrame (variable, example: periods of 15 minutes each). In each period we will insert
all data where [login to view URL] is between that timeFrame.
For each period we need to calculate average of data1, data2, and data1 + data2, also getting the highest
value (peak) of each data value in every period resulting on each period generating an object like this:
period1: {
avgData1: avg(arrayOfData1),
avgData2: avg(arrayOfData2),
avgData1+Data2: avg(arrayOfData1+Data2),
peakData1: max(arrayOfData1),
peakData2: max(arrayOfData2)
}
period2:...
...
period10:...
Once we have all the averages and peak values of each period then we proceed to calculate collective
averages of all the periods results. For example sum(period[avgData1] for period in periods) / [login to view URL],
sum(period[avgData2] for period in periods) / [login to view URL], ...
Final result will return an object like this:
{
data1Result: Number,
data2Result: Number,
data1+Data2Result: Number,
data1PeakResult: Number,
data2PeakResult: Number
}
Problem:
Translate this algorithm with Numpy or Pandas and reduce the compilation time for big data analysis.
We've tried putting all data of each period in independent Numpy arrays and calculating averages
then but the results took longer, maybe we are not using Numpy as intended.
[login to view URL]
I am very happy to bid your project.
i'd like to work with you.
I read your requirements carefully,i see what you mean.
i've been experienced with Data Analysis, Data Mining, Data Processing, NumPy, Python.
i 'll complete this project without condition.
i'm looking forward to meeting you.
$66 USD in 1 day
4.8 (9 reviews)
4.5
4.5
11 freelancers are bidding on average $172 USD for this job
Hi there.
Just read your job description carefully and I'm very interested in it.
As you can see my profile, I have gained vast experiences in python pandas/numpy.
I can reduce time for big data with pandas/numpy.
Let me do your job.
Thank you
Regards
Hi
Nice to meet you.
I have enough experience in python script.
Below the libraries are I used in past project.
selenium, pandas, matplotlib, lxml, beautifulsoup, scipy, and other useful libraries.
I have written some automation and scraping, scientific scripts.
So I think I can help you in your project.
Just let me know if you want start job.
Regards.
Lian
Dear Sir.
Glad to meet you.
I'm very interested in your job post.
I've full experience in responsive design.
Please Hire me.
I'll do my best, will make you pleasure with my work.
Thank you.
Relevant Skills and Experience
My Expertise:
Bootsrape CSS
PHP(several Frameworks: Laravel, Codeigniter, Wordpress, Shopify, Magento, Opencart)
HTML5
JS(JQUERY, Angular JS, Vue JS, React)
Python(Django, Flask)
Hi, very nice to meet you !
I've great experience in Python.
When you read my profile page, you can find that I'm a python expert.
I've 10+ years of experience in Python especially with Django and Flask for web development, and also selenium and tkinter, numpy, pyqt, cv2, pandas, etc.
On the other hand I made instagram b*t with Python.
Your job is very nice to me and you'll be satisfied with my work.
I can finish this within a few hours or a few days.
Thanks.
I'm computer engineering TA with 10+ years of experience.
Experienced with data structures and algorithms , computation theory , discrete math , database design using python
Experienced with python programming numpy,scipy,pandas , matplotlib , qt4 , tkInter , GUI ,signal processing , big data analytics , machine learning
through python 2.7 and 3.6
Developed megaprojects using python , recently complete optical fiber simulator tool using pyqt5
Will provide you fully commented code + any required documentations
I am signal processing Teaching Assistant and also a Computer& Electronics engineer . I know Python&Numpy very well and I used Numpy in many processig projects. I will give you the task finished efficiently and quickly as well !
Dear
As I am a senior software developer, have rich experience with various application development
using C#, VC, VB.Net, NodeJS, Matlab, java, and python
If you are interested with my proposal, please let me know it.
I hope to work with you on this project.
Thanks.
Hi, i am a Data Scientist working in machine learning from past 3 years. i have done many projects like time series forecasting, anomaly detection and many more. i have readen your problem statement i can do it easilyand very accuratly.