Required Skills

Data Engineer

Work Authorization

  • US Citizen

  • Green Card

Preferred Employment

  • Corp-Corp

Employment Type

  • Consulting/Contract

education qualification

  • UG :- - Not Required

  • PG :- - Not Required

Other Information

  • No of position :- ( 1 )

  • Post :- 18th Nov 2022

JOB DETAIL

  • Design and build reusable components, frameworks and libraries at scale to support analytics products
  • Design and implement product features in collaboration with business and Technology stakeholders
  • Identify and solve issues concerning data management to improve data quality
  • Clean, prepare and optimize data for ingestion and consumption
  • Collaborate on the implementation of new data management projects and re-structure of the current data architecture
  • Implement automated workflows and routines using workflow scheduling tools
  • Build continuous integration, test-driven development and production deployment frameworks
  • Collaboratively review design, code, test plans and dataset implementation performed by other data engineers in support of maintaining data engineering standards
  • Analyze and profile data for designing scalable solutions
  • Troubleshoot data issues and perform root cause analysis to proactively resolve product and operational issues

Experience:

  • Strong understanding of data structures and algorithms
  • Strong understanding of solution and technical design
  • Has a strong problem solving and analytical mindset?
  • Able to influence and communicate effectively, both verbally and written, with team members and business stakeholders
  • Able to quickly pick up new programming languages, technologies, and frameworks
  • Advanced experience building cloud scalable, real time and high-performance data lake solutions
  • In-depth understanding of micro service architecture
  • Strong understanding of developing complex data solutions
  • Experience working on end-to-end solution design
  • Able to lead others in solving complex problems by taking a broad perspective to identify innovative solutions
  • Willing to learn new skills and technologies
  • Has a passion for data solutions

Required and Preferred Skill Sets:

  • 1 -2 years of hands-on experience in AWS - EMR [Hive, Pyspark], S3, Athena or any other equivalent cloud; Ability to solve complex problems
  • 1-2  years of hands-on experience Spark Batch Processing and some familiarity with Spark Structured Streaming; Ability to solve complex issues
  • 1-2  years’ experience working experience with Hadoop stack dealing huge volumes of data in a scalable fashion
  • 2-3 years of hands-on experience with SQL, ETL, data transformation and analytics functions; Ability to solve complex problems
  • 2-3 years of hands-on Python experience including Batch scripting, data manipulation, distributable packages;  Ability to solve complex problems
  • 2-3 years’ experience working with batch orchestration tools such as Apache Airflow or equivalent, preferable Airflow
  • 2-3 years working with code versioning tools such as GitHub or BitBucket; expert level understanding of repo design and best practices
  • 2-3 years working with deployment automation tools such as Jenkins and familiarity with containerization concepts such as Docker and Kubernetes
  • 2-3 years of hands-on experience designing and building ETL pipelines; expert with data ingest, change data capture, data quality; hand on experience with API development; some exposure to Nifi or Kafka
  • 2-3 years designing and developing relational database objects; knowledgeable on logical and physical data modelling concepts; some experience with Snowflake
  • Preferred 1+ years of experience supporting Tableau or Cognos use cases
  • Familiarity with Agile; working experience preferred

Education: Bachelor's degree in IT or related field or Associates + 6 Years

Company Information