Find Jobs
Hire Freelancers

Data engineer project / python / PostGres/

₹1500-12500 INR

Cerrado
Publicado hace más de 3 años

₹1500-12500 INR

Pagado a la entrega
Website: [login to view URL] Data Schema 1. The link contains json data for various data sources. You would have to scan through and filter any COVID related data 2. Design a schema for every state and store data in the respective tables per state 3. Apply various indexing technique on PostGres to enable fast searching DAG performance and efficiency 2. Concepts of Distributed Computing 3. Your choice of schema design 1 NF, 2 NF, 3NF 4. Utilize any async process while performing any loads 5. How would you scale DAG with increase in data volume 6. Logging and monitoring if any failure happens 7. Object oriented design 1. Use Python 3 for developing the solution 2. Utilize Apache Airflow to design a daily dag that would run every day. 3. Create a task within the dag to iterate through the json and download the locally. 4. Create task to load the files into PostGres Schema 5. Optimize your dag performance by achieving max parallelism locally. You could utilize parallelism for task, dag concurrency, thread pool or max_threads 6. Follow the ETL process of Extract, Transform and Load 7. Each dag task should be independent and should be able to run individually. 8. Implement unit or integration test 9. Containerize your application inside a docker container. Use docker-compose if required
ID del proyecto: 28925163

Información sobre el proyecto

2 propuestas
Proyecto remoto
Activo hace 3 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos
2 freelancers están ofertando un promedio de ₹20.556 INR por este trabajo
Avatar del usuario
I am excellent at python and in other languages also. I worked on many projects like websraping, python automation and data analytics... I will work for this project because it's very easy task for me... please discuss your further project over chat..
₹11.111 INR en 7 días
5,0 (6 comentarios)
2,2
2,2
Avatar del usuario
I've experience devploying Airflow platform (Kubernetes executor + Postgresql main repo) to manage analytical processing mainly with pandas. I have read you project description carefully , docker containerization could be used as well instead of dedicated operators as proposed .
₹30.000 INR en 20 días
0,0 (0 comentarios)
0,0
0,0

Sobre este cliente

Bandera de UNITED STATES
Mumbai, United States
0,0
0
Miembro desde dic 16, 2012

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.