Develops and maintains scalable data pipelines. Collaborates with analytics and business teams to improve data models that feed business intelligence tools, increasing data accessibility and fostering data-driven decision making across the organization. Implements processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
The candidate must have the following qualifications:
The candidate must have the following qualifications:
- Strong background in Data Warehousing and Data Lakes.
- Knowledge of Cloudera Hortonworks, AWS, and the Hadoop ecosystem
- Knowledge of Linux system administration
- Experience debugging issues in unfamiliar systems
- Excellent collaboration and communication skillsProgramming experience would be ideal, especially in R, Python, or Scala
Source link