Required Skills

XML Ab-Initio Talend DataStage Syncsort

Work Authorization

  • Us Citizen

  • Green Card

  • EAD (OPT/CPT/GC/H4)

  • H1B Work Permit

Preferred Employment

  • Corp-Corp

Employment Type

  • Consulting/Contract

education qualification

  • UG :- - Not Required

  • PG :- - Not Required

Other Information

  • No of position :- ( 1 )

  • Post :- 31st Dec 2020

JOB DETAIL

Job Title: Data Engineer (Min 10+ years exp. required)

Duration: 12 months

Location: Pittsburgh, PA

 

Role/Responsibilities:

·          This role is expected to provide scripting and automation horsepower for the Reports & ETL rationalization project.

·         This will include understanding problem statement, designing and building solution steps using scripting, to achieve for meta-data extraction, XML and JSON data parsing, data manipulation, comparison, fuzzy matching, deduplication, entity resolution and automation. 

 

Required and Desired Experience/Skills:

·         3-4 years of experience in Python/PySpark design and development, primarily using Python for meta-data extraction, XML and JSON data parsing, data manipulation, comparison, fuzzy matching, deduplication, entity resolution and automation.

·         Strong understanding and experience with different Python Packages used for above purpose (for data parsing, manipulation, comparison, dedupe, automation etc, including Pandas, dedupe).

·         Hands-on experience developing optimized, complex SQL.

·         ETL experience on any Integration tools Informatica, Ab-Initio, Talend, DataStage, Syncsort.

·         Experience with Big Data/Hadoop platforms like Cloudera, Hortonworks.

·         Experience working on Data intensive projects (Data Warehouse preferred).

·         Scripting languages like Shell and Perl and experience with Regex (regular expressions).

·         Strong communication skills are a must.

 

Company Information