Data Engineer / India / Mumbai / 2 Years Experience

Required Skills

Analytical Hadoop Data quality big data Monitoring Data architecture

Work Authorization

Citizen

Preferred Employment

Full Time

Employment Type

Direct Hire

education qualification

UG :- - Not Required
PG :- - Not Required

Other Information

No of position :- ( 1 )
Post :- 18th Oct 2022

JOB DETAIL

We are looking for a Big Data Engineer that will work on the collecting, storing, processing, and analysingof huge sets of data. The primary focus will be on choosing optimal solutions to use for these purposes, then maintaining, implementing, and monitoring them. You will also be responsible for integrating them with the architecture used across the company.

Education Qualification

BE, BTECH, Bsc computer science, Msc, Mtech

Responsibilities

Selecting and integrating any Big Data tools and frameworks required to provide requested capabilities

Implementing ETL process

Monitoring performance and advising any necessary infrastructure changes

Defining data retention policies

Analyze and organize raw data

Build data systems and pipelines

Evaluate business needs and objectives

Interpret trends and patterns

Conduct complex data analysis and report on results

Prepare data for prescriptive and predictive modelling

Build algorithms and prototypes

Combine raw information from different sources

Explore ways to enhance data quality and reliability

Identify opportunities for data acquisition

Develop analytical tools and programs

Collaborate with data scientists and architects on several projects

Skills and Qualifications

Proficient understanding of distributed computing principles

Management of Hadoop cluster, with all included services

Ability to solve any ongoing issues with operating the cluster

Proficiency with Hadoop v2, MapReduce, HDFS

Experience with building stream-processing systems, using solutions such as Storm or Spark-Streaming

Good knowledge of Big Data querying tools, such as Pig, Hive, and Impala

Experience with Spark

Experience with integration of data from multiple data sources

Experience with NoSQL databases, such as HBase, Cassandra, MongoDB

Knowledge of various ETL techniques and frameworks, such as Flume

Experience with various messaging systems, such as Kafka or RabbitMQ

Experience with Big Data ML toolkits, such as Mahout, SparkML, or H2O

Good understanding of Lambda Architecture, along with its advantages and drawbacks

Experience with Cloudera/MapR/Hortonworks

Data Engineer