Find Jobs
Hire Freelancers

Problem solution for Spark use case

$750-1500 USD

Cerrado
Publicado hace casi 7 años

$750-1500 USD

Pagado a la entrega
1) SO I have 60 Millions usa and canada postals Created dataframe customer_df => good and bad => 60M 2) downloaded some Good postals from internet to filter the customer_df data Created one dataframe good_df => which is good postals => 1M 3) Perfomed Join between customer_df and good_df wiht zipcode to seperate the good values filter_df = good zip [login to view URL](cus_df,zipcode) 4) Then seperated bad data with the below logic bad_df = [login to view URL](filter_df) Now still we can filter bad_df with city names city_df = [login to view URL](bad_df,city) Then did unioin between both df's total_filter = [login to view URL](city_df) it taking 1.30 mints (used spark with 8 node cluster each node 32 gb => spark-submit driver memory -8g and num-executors - 8 and executor-memory- 8g) any other technology or any other tool to clean-up the data within 15 to 20 mints(again customer data is 60M
ID del proyecto: 14803384

Información sobre el proyecto

16 propuestas
Proyecto remoto
Activo hace 7 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos
16 freelancers están ofertando un promedio de $1.101 USD por este trabajo
Avatar del usuario
I am a data scientist and have experience with Big Data Technologies like Spark and Hadoop. I also have experience with NoSQL databases like HBase, Cassandra, etc. Previously I have worked with in Spark related projects like - Real Time ClickStream Analysis using Spark Streaming, Twitter Sentiment using SparklyR and others. I also have worked with Messaging Queues like Kafka. I would like to help you. Please provide me more details.
$1.100 USD en 5 días
5,0 (6 comentarios)
4,4
4,4
Avatar del usuario
I propose first analyzing your current algorithm to find the bottleneck and either rewriting your algorithm, reconfiguring your environment or finding other technologies e.g. Impala. Relevant Skills and Experience I have experience working with big data in hadoop clusters using Hive, Apache Pig + Java, Spark and Impala. Proposed Milestones $400 USD - Analysis of current code and environment, list possible solutions in order of priority $850 USD - Test candidate solutions and implement best solution
$1.250 USD en 20 días
5,0 (3 comentarios)
3,2
3,2
Avatar del usuario
Hello, I am 7+ years experienced Big data developer and I understand the job and will provide the desired solution. Please spare some time to discuss further. Relevant Skills and Experience My Key skills are: Java J2EE, HBase, Hive, Pig, Cassandra, Spark, Hadoop, Cassandra, Scala, MongoDB and latest cutting edge technologies. Proposed Milestones $1079 USD - Big data developer
$1.079 USD en 12 días
4,7 (4 comentarios)
3,5
3,5
Avatar del usuario
Hi, We are 5 big data enthusiasts with expertise in core technologies like Hadoop,spark,mongodb,hive,pig,R,etc. All of us have the development experience on platforms like Scala,python and java. Our vision is to deliver best solutions to our clients with great team work and dedication. To know more, kindly check our profile Thanks, Team-UBF
$750 USD en 10 días
4,9 (6 comentarios)
3,4
3,4
Avatar del usuario
Hello Sir... I have a very good experience in Spark & Scala. Please contact me for more details when possible. I look forward to work for you Sir. Best Regards. Relevant Skills and Experience I am a computer science tutor, I teach (among others) Data analysis and Algorithms. Proposed Milestones $750 USD - 1
$750 USD en 15 días
5,0 (2 comentarios)
2,8
2,8
Avatar del usuario
I am new to freelancer but I have been working on field of Big data for more than 3 years. The project description tells you are technical as well. I think the pseudocode can be optimized. Relevant Skills and Experience I have more than 3 years experience on Big data technology like Hadoop, Spark, Cascading, Elasticsearch, Redis, etc. I have worked on several data processing and optimizations problems. Proposed Milestones $833 USD - Project completion using same data and resources (some other technologies can be added) I would like to get data and access to clusters so that I can start working right away.
$833 USD en 10 días
5,0 (1 comentario)
0,4
0,4
Avatar del usuario
I have experience in tuning and debugging Spark jobs for one of the Fortune 6 companies which processes large amount of data.
$750 USD en 10 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
Hello, With an experience of 7 Years into Java, 3 Years into Hadoop & 1+ year into Spark, excellent solution is guaranteed. Whats your value for "--master" and "--deploy-mode" in spark-submit command Relevant Skills and Experience Spark, Java, RDD Proposed Milestones $250 USD - Discussion on spark command and showing optimization demo $583 USD - Deliver the entire project Whats your value for "--master" and "--deploy-mode" in spark-submit command ?
$833 USD en 20 días
0,0 (1 comentario)
0,0
0,0
Avatar del usuario
Hello there. I have seen your job posting. I will like to ask some questions. Please come over the chat so we can discuss things. Relevant Skills and Experience All the skills/experience will be discussed/revealed upon chat. Proposed Milestones $625 USD - default $625 USD - default I need to know the technical details of this project. Please provide me my job description.
$1.250 USD en 20 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
I have experience in working with Apache Spark and the manipulate DataFrames and RDD, by means of python
$1.111 USD en 5 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
Hello, i have a lot experience in the field g feel free to ask for my work,............................
$1.500 USD en 20 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
I am a Big Data Engineer certified by Simplilearn Relevant Skills and Experience Big Data Hadoop and Spark Developer Proposed Milestones $750 USD - It will be cleared in 8 days (Only Weekends will be calculated as working day)
$750 USD en 8 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
django PHP Arduino hadoop metatrader web design python machine learning HTML,HTML5 graphic design wordpress Android unity3d Relevant Skills and Experience django PHP Arduino hadoop metatrader web design python machine learning HTML,HTML5 graphic design wordpress Android unity3d Proposed Milestones $1666 USD - full
$1.666 USD en 20 días
0,0 (0 comentarios)
0,0
0,0

Sobre este cliente

Bandera de UNITED STATES
United States
0,0
0
Forma de pago verificada
Miembro desde abr 7, 2017

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.