Experience with big data tools: Hadoop, Spark, Kafka, etc.
Experience with object-oriented/object function scripting languages: Python, Scala, etc.
Experience with relational SQL and NoSQL databases, including Postgres, MongoDB. Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
Experience building and optimizing big data data pipelines, architectures and data sets.
Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
Strong analytic skills related to working with unstructured datasets.
A successful history of manipulating, processing and extracting value from large disconnected datasets.
Experience with AWS cloud services: EC2, EMR, RDS, S3
Experience with data pipeline and workflow management tools: Airflow, Nifi etc.
Experience with Data Visualization tools like Power BI, Tableau etc