- Excellent in advance SQL, ETL. Must understand relation and ER diagrams/normal forms. How to design, create, extend/iterate, manipulate, seed with realistic data/data exploration, create and optimize queries, operate at low scale and high scale.
- Experience in creation of data pipelines using Python. Python for data manipulation & transformation (python dictionaries, data frames, data stream, joins of all kinds, outside of SQL IDEs)
- Experience with Data Warehousing Architecture
- Research skills – go figure it out and come back with working model. Pros/cons analysis skills
Good to have - End to end understanding of data center operations & applying optimizations/automations using data science Education – preferred MS/PHD candidate