Previous Job
Previous
Data Engineer Lead
Ref No.: 21-00518
Location: New Jersey, New Jersey
Position Type:Contract
Experience building and administering big data and real-time streaming analytics architectures in cloud environments (Preferably in AWS) leveraging technologies such as Hadoop, Spark, S3, EMR, Postgres, Redshift, Airflow, and Hudi
Experience architecting, building and administering large-scale distributed applications
Knowledge of Linux operations including basic commands and shell scripting experience
Familiarity with DevOps methodologies and Continuous Integration/Continuous Delivery within a large scale data delivery environment
Software development experience in least three or more of following languages: Python, Scala, Node.js (Preferably Python 3)
Expertise in usage of SQL for data profiling, analysis and extraction

Responsibilities for Engineers for data platform:

Working collaboratively with other engineers, data scientists, analytics teams, and business product owners in an agile environment
Architect, build and support the operation of our Cloud infrastructure and enterprise data platform
Design robust, reusable and scalable data driven solutions and data pipeline frameworks to automate the ingestion, processing and delivery of both structured and unstructured batch and real-time streaming data
Build data APIs and data delivery services to support critical operational processes, analytical models and machine learning applications
Assist in selection and integration of data related tools, frameworks and applications required to expand our platform capabilities
Understand and implement best practices in management of enterprise data, including master data, reference data, metadata, data quality and lineage