Experience Big data technologies Hadoop, HDFS, Hive, Sqoop, Spark , NoSQL Cloudera Impala.
Experience in integrating disparate data sources such as flat files, databases, xml files and/or unstructured data web services
Extensive experience with data analysis and data engineering (data integration and transformation)
Excellent communication and collaboration skills are required. Ability to work independently and as a key contributor in a distributed team environment
Good to have : Shell scripting
Good to have with Amazon Web Services (S3, EC2, DynamoDB, Data pipeline, Athena, RDS, EMR, Red Shift etc)
Good to have : Understanding of relational databases and data integration technologies, and prior experience with traditional ETL tools (Informatica, Talend etc.)