Job Description :
Summary:The Associate Data Engineer is responsible for operational support of CDP across customers. This position requires strong data engineering and operational skills.
Integrate large datasets from multi-channel client data (web, ad delivery, email, app, ecommerce) into the database and make the data available for analytics and reporting. The CDP, once implemented, drives data driven marketing and personalization at scale for our customers.
Work with advanced data management frameworks and utilities built on big data and open-source technologies including Hadoop, PySpark, Scala, Redshift, MySQL. In addition, deploy and support the platform on AWS, GCS, Azure, and hosted infrastructure.
Ability to learn new database and cloud technologies quickly, analytical thinking, delivering quality results, excellent communication, and ability to work in an Agile environment are fundamental to this role.
Ability to do database development work when Operations come to steady state in a project.
Position Details:
Knowledge, Skills and Experience:

  • 1+ years ofexperience in supportingoperationsin big data technologies (Hadoop,Hive,Spark,Yarnetc.)
  • Knowledge ofPython,PySparkis a must.
  • Working knowledge of RDBMS (MySQL, SQL Server etc.) is a must

  • Proven capability in:
  • Taking ownership of Opsand providing innovative solutions (Automation, reports, alerts etc.)
  • Strong Analytical skills with ability to do RCA of issues and to provide bugfixes.
  • Optional:Knowledge ofJava, Scala, Airflow, Kafkaor other languages/tools.
  • Optional:Knowledge ofNoSQL Databases (Hbase, MongoDB, Cassandra etc.)& cloud platforms(Azure/AWS/GCP)

Responsibilities:

  • ProvideOperational excellence in supporting CDP across customers.
  • Provide ideas and inputs on automatingoperational alerts and metrics.
  • Fine tune processes to optimizeutilization of cloud resources.
  • Strong analyticalskillsalong withability to debug complex code andpin-point code issues.

  • Ability to do code enhancements&bug-fixesforissues arising from operationalfailures.
  • Design and implementQCprocessesto ensure dataquality.
  • Provide feedback and ideas on improvements to the CDP feature set and usability with the aim of continuously improving the implementation processesand results.
  • Ability to work with development teams to contribute towards code development when needed.
  • Create and maintain database artifacts, technical documents (functional specs, design document, data model) for all custom development.

Source link