Previous Job
Cloudera Data Engineer
Ref No.: 18-01384
Location: Philadelphia, Pennsylvania
Cloudera Data Engineer in Philadelphia, PA 19103

Interview Logistics:

Demonstrated hands-on experience using technologies like Scala and Python using Spark. Candidate should have extensive experience designing and implementing software solutions in support of data warehousing or data lake.

Healthcare knowledge is a definite plus since some of the data formats are related to standards like HL7.

Required Skills Set:

Years of Experience: 7-9 years of experience

Education Required: Bachelors Degree

This is less of a data science role and more of a data engineering role.

We are looking for a lead developer is familiar working with HBase, Solr and Hive using Spark to manipulate and ingest data inbound to the data lake.

Experience with the architecture of Cloudera or similar Hadoop build is required.

Additional Preferred Skills:

The candidate should have excellent written, and presentation skills, with the ability to translate complex technical ideas in to understandable business terminology.

Knowledge of Python/Scala development for data engineering in Cloudera environment.

Project Description:

Our team supports data ingestion and automation of data movement and transformation in the service of business users and data scientists in the organization. The data is healthcare related and we are processing delimited files, XML, images, structured and unstructured data.

The candidate should be prepared to actively be involved in development as well as designing different data engineering processes.

Physical Environment and Working Conditions:

Must be able to work on a large team.