Main Purpose of the role:
- The data modeler designs, implements, and documents data architecture and data modeling solutions, which include the use of relational, dimensional, and NoSQL databases.
- These solutions support enterprise information management, business intelligence, machine learning, data science, and other business interests.
- Role will focus on PepsiCo data modeling, metadata management, data governance support of all Pepsico business subject Areas.
- Role will support data design activities to ensure alignment of key attributes across local, sector, and globally defined attributes.
Accountabilities:
- Governs data design/modeling - documentation of metadata (business definitions of entities and attributes) and constructions database objects, for baseline and investment funded projects, as assigned; supports NAB/PBC Applications, Global MDM, EDW Datalake /Lakehouse Modeling (Accountable)
- Provides and/or supports data analysis, requirements gathering, solution development, and design reviews for enhancements to, or new, applications/reporting (Accountable/Consultation)
- Independently complete designs and database object constructs on any supported platform, including Microsoft Azure, Synapse, Teradata, Sybase, Salesforce, Oracle, and/or DB2 (Accountable)
- Supports assigned project contractors (both on offshore), orienting new contractors to standards, best practices, and tools (Accountable)
- Advocates existing Enterprise Data Design standards; assists in establishing and documenting new standards (Accountable/Consultation)
- Contributes to project cost estimates, working with BRM s to evaluate the size and complexity of the changes or new development (Consultant).
- Implement business and IT data requirements through new data strategies and designs across all data platforms (relational, dimensional, and NoSQL) and data tools (reporting, visualization, analytics, and machine learning).
- Work with business and application/solution teams to implement data strategies, build data flows, and develop conceptual/logical/physical data models
- Define and govern data modeling and design standards, tools, best practices, and related development for enterprise data models.
- Identify the architecture, infrastructure, and interfaces to data sources, tools supporting automated data loads, security concerns, analytic models, and data visualization.
- Hands-on modeling, design, configuration, installation, performance tuning, and sandbox POC.
- Work proactively and independently to address project requirements and articulate issues/challenges to reduce project delivery risks.
- Knowledge on any cloud platforms, specifically Synapse and Azure is added advantage.
The successful candidate will:
- Be responsible for the development of the conceptual, logical, and physical data models, the implementation of RDBMS, operational data store (ODS), data marts, and data lakes on target platforms (SQL/NoSQL).
- Oversee and govern the expansion of existing data architecture and the optimization of data query performance via best practices. The candidate must be able to work independently and collaboratively.
- Primary focus would be to partner with Enterprise Architecture to create Data Models, establish data standards and govern across all development; advocate and assist in governing Enterprise Architecture standards and strategy via data solutions.
- This would include independently analyzing project data needs, identifying and resolving data storage and integration needs/issues and driving opportunities for data reuse; satisfying project requirements.
Qualifications/Requirements
Years of Experience:
Mandatory Technical Skills:
- 5 years experience in Information Technology or Business Relationship Management
- 5 years experience in performing data modeling/data design, hands-on relational, dimensional, and/or analytic experience (using RDBMS, dimensional, NoSQL data platform technologies, and ETL and data ingestion protocols).
- Experience with data warehouse, data lake, and enterprise big data platforms in multi-data-center contexts required.
- Bachelor s degree in Computer Science, MIS, Business Management, or related field.
- Excellent remote collaboration skills.
- Experience working in a matrix organization with diverse priorities.
- Solutions Delivery experience - expertise in system development lifecycle, integration, and sustainability.
- Advanced data modeling or database experience; experience with Data Lake/Lakehouse, Synapse, IDM, ER/Studio and/or Erwin modeling tools.
- Good knowledge of metadata management, data modeling, and related tools.
- Knowledge on Datalake/Cloud Platforms (Azure (synapse)/GCP(Bigquery) etc.), ADF, Databricks, Pipeline Creations is a plus.
- Good understanding of CPG/PepsiCo business processes, data, and EDW/MDM environments.
Mandatory Non-Technical Skills:
- Exceptional written and verbal communication skills along with collaboration and listening skills.
- Ability to work with agile delivery methodologies.
- Ability to ideate requirements design iteratively with business partners without formal requirements documentation.
- Ability to lead teams with effective collaboration.
Differentiating Competencies:
- Ability to work with virtual teams (remote work locations); lead team of technical resources (employees and contractors) based in multiple locations across geographies.
- Lead technical discussions, driving clarity of complex issues/requirements to build robust solutions.
- Strong communication skills to meet with business, understand sometimes ambiguous, needs,
- and translate to clear, aligned requirements.
- Able to work independently with business partners to understand requirements quickly, perform analysis and lead the design review sessions.