Introduction
At IBM, work is more than a job – it’s a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you’ve never thought possible. Are you ready to lead in this new era of technology and solve some of the world’s most challenging problems If so, lets talk.
Your Role and Responsibilities
Candidate should be comfortable & experienced with the basic concepts & performance tuning of various aspects in Hadoop Implementation, maintenance, monitoring & debugging.

  • Administration support across different clusters implementations i.e. Dev, Test, Prod etc for HDP & upcoming CDP/CDH
  • Monitoring the clusters and component services listed(but not limited to)/deployed in the clusters from single/multiple vendors.
  • Monitoring application jobs on Yarn ‒ management based on KPIs of project/dev/ops teams
  • Deep dive to RCA for all reported issues under HDFS or cluster issues
  • Proactive analysis & recommendation to prevent potential issues.
  • Configuration parameters
    • Installation/configuration of additional HDP supported components on the existing cluster as per the need basis/requirement/upgrade etc
    • SOP creation & revisions of any such documents, for any configuration changes under Hadoop.
  • Patch deployment, SHC, VA,MBSS and Security for CA and Kerborization
  • Troubleshooting and Issue Diagnosis
  • Handholding of any BAU configuration and issue for further troubleshooting
  • Creation and management of alerts related to any component service of HDP
  • Upgrade planning /Implementation and handholding.
  • Planning and implementation of HA and Backup process
  • Administration of Druid/Kafka/WXM & all such layers across Hadoop ecosystem
  • Collaboration with Cloudera support team on ticket resolution.
  • Collaboration with Ops/Project team on Unix related issues translation under Hadoop ecosystem
  • Closure of Trouble tickets raised by Business users / Project Teams
  • Attendance of Daily huddle call with Ops/Project and internal
  • HOT FIX/Bug fixes deployment as per Cloudera release
  • SOP creation and recording for new nodes addition in Clusters
  • Assets reconciliations and coordination with respective team. Assets up-date in tool
  • Collaboration with respective stakeholders.
  • Shift handling and planning.
  • Should take Team building & knowledge sessions to bring up capabilities in internal team.
  • Performance issue handling as per available options and optimization List of services being handled on the Hadoop cluster
  • Recommendation of industries best practice for HDP/CDP for performance tuning bi-monthly to meet SLA
  • Resizing the Clusters to meet SLA under advance planning and implementation.
  • Governance with Cloudera for cluster performance bi-monthly or as decided.

Required Technical and Professional Expertise

  • Ambari Services for Monitoring ( HDFS,YARN, MAPREDUCE)
  • Infra solar
  • Apache Atlas 2.0.0
  • Apache HBase 2.1.6
  • Apache Hive 3.1.0
  • Apache Kafka 2.0.0
  • Apache Knox 1.0.0
  • Druid
  • SmartSense 1.5
  • Grafana etc

Preferred Technical and Professional Expertise

  • Apache Oozie 4.3.1
  • Apache Pig 0.16.0
  • Apache Ranger 1.2.0
  • Apache log search
  • Apache Spark 2.3.2
  • Apache Sqoop 1.4.7
  • Apache TEZ 0.9.1
  • Apache Zeppelin 0.8.0
  • Apache ZooKeeper 3.4.6
  • Perimeters Security (CA Security and Kerberos)

Source link