- Bachelor’s Degree in Science, Technology, Engineering, or Mathematics
- Extensive background with Hadoop, Sqoop, Hive, etc. – 3 to 5 years exp (Mandatory experience)
- Very good SQL experience (Mandatory experience)
- NoSQL experience (Mandatory experience)
- Python (Mandatory experience)
- PySpark (Mandatory Experience)
- Apache Air Flow (Highly preferred)
- GCP (Data Engineering) (Prefer) but okay with other cloud technologies
- Streaming Jobs ( Kafka,)
- Nice to have : Machine Learning, Dashboard experience
BI Engineering Responsibilities
- Responsible for the development, maintenance, and enhancements of BI solutions of varying complexity levels across different data sources like DBMS, File systems (structured and unstructured) on-prem and cloud infrastructure; creates level metrics and other complex metrics; use custom groups, consolidations, drilling, and complex filters
- Demonstrates database skill (Teradata/Oracle/Db2/Hadoop) by writing views for business requirements; uses freeform SQLs and pass-through functions; analyzes and finds errors from SQL generation; creates RSD and dashboard
- Responsible for building, testing and enhancement of BI solutions from a wide variety of sources like Teradata, Hive, Hbase, Google Big Query and File systems; develops solutions with optimized data performance and data security
- Works with business analysts to understand requirements and create dashboard/dossiers wireframes; makes use of widgets and Vitara charts to make the dashboard/dossiers visually appealing
- Coordinates and takes necessary actions from DART side for application upgrades (e.g., Teradata, Workday), storage migration, and user management automation; supports such things as Cluster Management and Project Configuration settings