Us Citizen
Green Card
EAD (OPT/CPT/GC/H4)
H1B Work Permit
Corp-Corp
Consulting/Contract
UG :- - Not Required
PG :- - Not Required
No of position :- ( 1 )
Post :- 31st Dec 2020
Job Title: Data Engineer (Min 10+ years exp. required)
Duration: 12 months
Location: Pittsburgh, PA
Role/Responsibilities:
· This role is expected to provide scripting and automation horsepower for the Reports & ETL rationalization project.
· This will include understanding problem statement, designing and building solution steps using scripting, to achieve for meta-data extraction, XML and JSON data parsing, data manipulation, comparison, fuzzy matching, deduplication, entity resolution and automation.
Required and Desired Experience/Skills:
· 3-4 years of experience in Python/PySpark design and development, primarily using Python for meta-data extraction, XML and JSON data parsing, data manipulation, comparison, fuzzy matching, deduplication, entity resolution and automation.
· Strong understanding and experience with different Python Packages used for above purpose (for data parsing, manipulation, comparison, dedupe, automation etc, including Pandas, dedupe).
· Hands-on experience developing optimized, complex SQL.
· ETL experience on any Integration tools Informatica, Ab-Initio, Talend, DataStage, Syncsort.
· Experience with Big Data/Hadoop platforms like Cloudera, Hortonworks.
· Experience working on Data intensive projects (Data Warehouse preferred).
· Scripting languages like Shell and Perl and experience with Regex (regular expressions).
· Strong communication skills are a must.