Find Jobs
Hire Freelancers

Algorithm Optimization 3

$250-750 USD

Cancelado
Publicado hace casi 10 años

$250-750 USD

Pagado a la entrega
Objectives • We will provide one datasets with one target variable (“Score”), a timestamp and 24 independent variables. The dataset contains ~55 thousand observations (however your solution should be scalable to accommodate a much larger dataset). • The goal is to write at most 6 sets of “greater than” and “less than” restrictions on the independent variables. Each set of restrictions will return a subsample of the dataset on which we evaluate an objective function. • Specifically, the objective function is the sum of the target of the observations in the selected subsample. Each query (set of restrictions) has to return at least 10 valid responses. • In addition, any observations that come less than 60 seconds after a valid observation in this subsample will be removed. So each query has to return at least 10 responses that are 60 seconds apart from each other. • In other words, your goal in this project is cornering up to 6 regions of the dataset using intervals on the independent variables, and maximize the density of positive values of the target. Logistics: • You can find the dataset in the Excel file “[login to view URL]”. • You will see some variables have version A or version B (for instance W2 R2). In such cases you can use either one or the other, not both. • Your restrictions cannot be applied using a higher number of decimal places than occur in the observations. For instance a restriction to W1R1 cannot be 0.015, it must be either 0.01 or 0.02. Tips: • Regression analysis, Neural Networks, SVM, and K-clusters will not help you much. These methods classify observations by applying a weighted average of the independent variables. The classification rule has to be on the independent variables directly, cannot be on a weighted average of them or any other function. • Make sure to order the timestamp chronologically. • A start could be plotting the density of the independent variables for the subsample of positive target values and for the subsample of negative target values. Then you can identify regions with a high density of positive target observations. Reward/Milestones: • All accepted bids will be awarded on completion • We are going to judge the performance of each bid both inside the sample (milestone 1) and outside the sample (milestone 2). A good performance consists in a high aggregate sum of the target variable. • After this stage we will ask you provide details of how you would maintain the existing algorithms (milestone 3) over a much larger data set, ~500 thousand observations.
ID del proyecto: 5995085

Información sobre el proyecto

10 propuestas
Proyecto remoto
Activo hace 10 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos
10 freelancers están ofertando un promedio de $688 USD por este trabajo
Avatar del usuario
I am expert in matlab vectorization techniqu. Using that if we deal with large data set, we actually need less time like fraction of second but if you write the same program using loops then in that case for large data set it may require even 20-30 minute depending or ur data set. I have experirence in doing such works. So u can definitely rely on me regarding your project. Regards
$666 USD en 10 días
5,0 (52 comentarios)
5,5
5,5
Avatar del usuario
A proposal has not yet been provided
$998 USD en 10 días
5,0 (2 comentarios)
4,8
4,8
Avatar del usuario
Hi, very interesting project. Could you send me the dataset? I'd code this in R if you don't mind. Best regards, marcin
$400 USD en 7 días
4,8 (13 comentarios)
4,4
4,4
Avatar del usuario
Hi, I know the previous project was granted to someone else and since you said you need several ones to work in parallel, hence I would like to bid for this project. Please let me know specifically which task is expected from you. Thank you.
$388 USD en 10 días
4,7 (5 comentarios)
4,1
4,1
Avatar del usuario
HI Brother, I am Data Scientist working in Multinational Company. My work is to see the hidden pattern in the large and complex data sets and predictive analytics, Data mining,Machine Learning and also uses the statistical techniques on those datasets to see what data behaves in past. By seeing the past what will be the behavior the that data set in future. I am also working on "kaggle" competitions currently .I am also writing the column on All Analytics and my rank on All analytics in Statistician. I am also working on the Time Series analysis using ARIMA, SARIMA models and also working on econometric analsysi using Linear, Nonlinear, Logistic, Probit models, hypothesis testing and Cox regression for the survival analysis and also survey analysis etc. I am also the writer of the various research papers and one of them is relating medical data you can sight my paper at IEEEXPLORE. I am also using R, SPSS Statistics, Spss Modler, Stata, SAS, MATLAB, WEKA, Statistica and Pyhthon for Data Analysis for Machine Learning, Data Mining and Data Science. Thanks Regards, Muhammad Shoaib. Data Scientist
$555 USD en 10 días
4,9 (7 comentarios)
3,2
3,2
Avatar del usuario
I am a Subject Matter Expert in Mathematics, Statistics, Computer Science and Physics, and a SEO search engine optimization specialist. I worked as Matlab and Statistics Consultant for several years for many companies. I undertake SEO projects for websites. I have a Masters Degree in Mathematics from Purdue University, USA. I also provide tutoring, training, exam preparation, and assignment solution help in advanced college level Mathematics, Statistics, Physics and Computer Science courses.
$666 USD en 10 días
5,0 (2 comentarios)
3,2
3,2
Avatar del usuario
Hi, I have more than 14 years of exp and I am expert in this kind of work. I have completed more than 210 projects. Please look at the feedback left by my employers to know more about my work. Waiting for your positive response. Thanks.
$500 USD en 20 días
3,6 (1 comentario)
3,8
3,8
Avatar del usuario
A proposal has not yet been provided
$750 USD en 10 días
5,0 (3 comentarios)
2,8
2,8

Sobre este cliente

Bandera de UNITED KINGDOM
Birmingham, United Kingdom
4,9
26
Forma de pago verificada
Miembro desde abr 20, 2009

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.