Should be from Data Engineering/Data Management background.
Good understanding of data flow, data profiling and should have experience in data analysis activities.
Should be able to understand basic data model/entity relationship model.
Should be able to work with customer independently to gather requirements and lead the design artifacts with customer.
Good understanding in data Integration/ETL, Technical Spec documents, STTMs creation and understanding.
Good hands-on experience in Python and Pyspark used as part of data profiling, ETL data loading or data processing activities.
Experience in Data Management areas like Data Quality, MDM etc. Should be able to understand DQ cleansing rules, data survivorship, match and merge functionalities etc.
Good to have knowledge in any MDM tools and not mandatory.