The associate must know how to design a data pipeline Batch or online. Basically he should be comfortable questions designing a pipeline hybrid or cloud
Cloud Experience
The associate should be comfortable designing and deploying the data stack(Big Table, Cloud BigQuery, Cloud PubSub, Cloud storage) on google cloud or AWS and should have experience.
Data Processing
Skill sets needs are Spark framework on Java. Python is not needed as the codebase is strictly based off java.