US Citizen
Green Card
EAD (OPT/CPT/GC/H4)
H1B Work Permit
Corp-Corp
W2-Permanent
W2-Contract
Contract to Hire
Consulting/Contract
UG :- - Not Required
PG :- - Not Required
No of position :- ( 1 )
Post :- 2nd Nov 2023
- working with Hadoop based or Cloud based big data management environment
- bash scripting or similar experience for data movement and ETL
- Big data queries in Hive/Impala/Pig/BigQuery (Sufficient in BigQuery API libraries to data prep automation is a plus)
- Advanced Python programming including PySpark (Scala is a plus) with strong coding experience and Proficient in data studio, Big Table, GitHub working experience (Cloud composer and Data flow is a plus)
- basic gcp certification is a plus
- Knowledge of Kubernetes is a plus (or other types of GCP native tools of the container-orchestration system for automating computer application deployment, scaling, and management)
- Basic knowledge in machine learning (ensemble machine learning models, unsupervised machine learning models) with experience using Tensorflow and PyTorch is a plus
- Basic knowledge in graph mining and graph data model is a plus
- Understand best practices for data management, maintenance, and reportingand use that knowledge to implement improvements in our solutions.
What You'll Do:
Build automated Client/AI modules, job, and data preparation pipelines by gathering data from multiple sources and systems, integrating, consolidating and cleansing data, and structuring data and analytical procedures for use by our clients in our solutions.
Perform design, creation, and interpretation of large and highly complex datasets
Consult with internal and external clients to understand the business requirements so successfully build datasets and implement complex big data solutions (under senior lead's supervision).
Ability to work with Technology and D&A teams to review, understand and interpret the business requirements to design and build missing functionalities to support the identity and fraud analytics needs (under senior lead's supervision).
Ability to work on the end to end interpretation , design, creation, and build of large and highly complex analytics related capabilities (under senior lead's supervision).
Strong oral and written communication skills, and ability to collaborate with cross-functional partners
Qualifications:
3+ years of professional data engineering or data wrangling experience in
TOP REQUIRED SKILLS:
3+ years of professional experience as a data engineer
3+ years working with Python and SQL.
Experience with state of the art machine learning algorithms such as deep neural networks,support vector machines, boosting algorithms, random forest etc. preferred
Experience conducting advanced feature engineering and data dimension reduction in Big Data environment is preferred
Strong SQL skills in Big Data environment (Hive/ Impala etc.) a plus
Things that would stand out on resume -
1- Masters Degree in Computer Science & Data Science
2- Previous Company - Any Bank, Ecommerce