- 5-10 years of recent hands on experience in data engineering and pipeline development.
- Programming experience, ideally in Python, Scala with Spark, Kafka, and a willingness to learn new programming languages to meet goals and objectives.
- Experience in distributed computing and MapReduce paradigm is a must.
- Understanding Hadoop ecosystem components like HIVE is must.
- Knowledge of data cleaning, wrangling, visualization and reporting using tools like Looker/Quilk.
- Experience processing large amounts of structured and unstructured data, including integrating data from multiple sources.
- Use tools like DBT (data ) and workflows like airflow/Perfect for data transforms and pipeline is a plus.
- Knowledge of data mining, machine learning, natural language processing, or information retrieval is a plus.
- Experience in production support and troubleshooting is a plus.
- Strong knowledge of and experience with statistics.
Regards
Prathap
Senior Technical Recruiter
VDart Inc
Phone: 678-720-5251
Email: prathap.t@vdartinc.com
LinkedIn: www.linkedin.com/in/prathap-sam-632769145
www.vdartinc.com