Previous Job
Previous
Hadoop Developer
Ref No.: 17-00314
Location: San Mateo, California
Position Type:Contract
Start Date: 07/24/2017
Project Summary Building Data Infrastructure components such as Tracking, Messaging Queues, and Real Time Streaming & Batch Data Pipelines
Responsibilites - In detail • Play a key developer role and be part of building Data Infrastructure components such as Tracking, Messaging Queues, and Real Time Streaming & Batch Data Pipelines
• Deliver high quality data engineering components/services that are robust and scalable
• Collaborate and communicate effectively with other team members to deliver strong results
• Methodical approach to areas such as Data Modeling, Data Quality
• Guide others in the team on architecture, design and quality engineering practices
• Leverage these foundational Data Infrastructure to integrate machine learning & statistical models into real time services and power the BI & Visualization layers
• Work closely with data scientists to assist on feature engineering, model training frameworks, and model deployments at scale
Key Technical Skill Big Data technologies such as Hadoop, Amazon EMR, Pig, Hive, Spark, and Redshift
expert level SQL programming knowledge and experience.
One programming language - Python, Java or Scala
Desired Skills • Experience in Web Services, API integration, Data exchanges with third parties is preferred
Experience Level 2+ years