Ability to support large scale production Hadoop environment in any of the Hadoop distributions.
Proficiency in Designing, Capacity planning and cluster setup for Hadoop
Hadoop operational expertise such as troubleshooting skills, bottlenecks, management of data, users, and job execution, basics of memory, CPU, OS, storage, and network
Experience in any of the Scripting Language (Perl, Shell, Python)
Product knowledge on Hadoop distributions such as Cloudera, Horton work or Map
Product knowledge on Hadoop distributions such as Cloudera, Horton work or Map
Administration, maintenance, control, and optimization of Hadoop capacity, security, configuration, process scheduling, and errors
Development or administration on any NoSQL technologies
Development/scripting experience on configuration management and provisioning tools e.g. Puppet, Chef
Development, implementation or deployment experience on the Hadoop ecosystem (HDFS, MapReduce, Hive, Base)
Analysis and optimization of workloads, performance monitoring, tuning and automation.
Addressing challenges of query execution across a distributed database platform on moderm
Proficiency with at least one of the following: Java, Python or Perl.
Experience in tool integration, automation, configuration management in GIT, Jira platforms
Excellent oral and written communication, presentation skills, analytical and problem-solving skills
Self-driven, ability to work independently as well as a part of a team.