Data Warehouse Design & Implementation: Design, implement, and optimize data warehouse solutions using Data Warehouse Appliances such as Teradata, Snowflake, Amazon Redshift, or similar technologies. Ensure efficient data storage, retrieval, and processing capabilities.
ETL Development with Informatica: Build and manage ETL processes using Informatica to automate the extraction, transformation, and loading of data into the data warehouse. Ensure data is transformed in line with business needs.
SQL Query Optimization: Write complex SQL queries to extract, manipulate, and analyze large volumes of data across relational and cloud-based data sources. Optimize queries for performance in large-scale environments.
Data Modeling: Design and implement data models (conceptual, logical, and physical) to ensure data is structured efficiently for both storage and analytics. Collaborate with data architects and analysts to create flexible and scalable data models.
Data Integration: Integrate and consolidate data from multiple sources (e.g., transactional systems, APIs, cloud storage, and flat files) into a centralized data warehouse for analysis and reporting.
Data Pipeline Development: Develop, maintain, and optimize robust data pipelines that can handle large volumes of data and support real-time or batch processing needs.
Performance Tuning & Optimization: Monitor and optimize the performance of data pipelines and ETL workflows, ensuring efficient data processing, low latency, and high scalability.
Data Quality & Governance: Ensure the accuracy, consistency, and reliability of data within the data warehouse by implementing data validation and data quality checks.
Collaboration & Communication: Work closely with data scientists, business analysts, and other technical teams to understand data requirements and deliver solutions that support data-driven decision-making.
Documentation & Best Practices: Maintain comprehensive documentation for ETL workflows, data models, and data integration processes. Follow industry best practices and company standards for data architecture and engineering.