Companies/EXL/Data Ontology Architect
EXLEXL

Data Ontology Architect

Noida, Uttar Pradesh, India2 Jul 20260KY9RM
exl/data-ontology-architect

Data Ontology Architect

Job Description

Data Ontology Architect Job Description

Data Ontology Architect

Department: Data
Location: Hybrid / On-site
Experience: 5+

ABOUT THE ROLE

We are seeking a Semantic Data Architect to lead the design and operationalization of our enterprise data governance framework. You will own data cataloging, end-to-end data lineage, and governance policy implementation, ensuring our data assets are trustworthy, discoverable, and compliant across all domains.

Data Catalog | Data Lineage | Data Governance | Metadata Management

KEY RESPONSIBILITIES

  • Design and implement enterprise ontologies, semantic models, taxonomies, and knowledge graphs to support data governance and AI-driven business applications.
  • Define and manage enterprise data lineage, metadata management, data cataloging, and semantic interoperability standards across platforms.
  • Develop governance frameworks for data quality, stewardship, classification, ownership, compliance, and lifecycle management.
  • Architect conceptual, logical, and physical data models aligned with enterprise architecture, governance standards, and business requirements.
  • Design and support scalable data products enabling trusted, reusable, and domain-driven data consumption across analytics and AI platforms.
  • Develop and implement Medallion Architecture data models (Bronze, Silver, Gold layers) for scalable and governed enterprise data platforms.
  • Design and develop Databricks Data Vault solutions including hubs, links, satellites, historization, and lineage tracking for enterprise analytics and governance use cases.
  • Architect semantic data models and ontology frameworks to improve data discoverability, traceability, and contextual understanding.
  • Build and integrate knowledge graphs, metadata repositories, vector databases, and enterprise data platforms for contextual AI and analytics.
  • Collaborate with business, governance, engineering, and AI teams to establish enterprise-wide data standards and domain models.
  • Implement ontology alignment, schema mapping, and master/reference data management across complex enterprise systems.
  • Design and support AI-driven data governance workflows including lineage tracking, policy enforcement, access control, and auditability.
  • Develop agentic AI solutions using frameworks such as LangGraph, AutoGen, and CrewAI to automate metadata enrichment, governance, and workflow orchestration.
  • Ensure observability and monitoring of data and AI systems through lineage tracing, metadata tracking, and operational dashboards.
  • Apply governance and security controls including prompt injection defense, role-based access control, and secure data handling practices.
  • Optimize semantic and governance platforms for scalability, reliability, compliance, and production deployment.
  • Build CI/CD processes for ontology releases, governance workflows, metadata pipelines, and AI deployments.
  • Stay current with emerging trends in data governance, metadata management, semantic web technologies, knowledge graphs, and agentic AI best practices.

REQUIRED SKILLS & EXPERIENCE

  • Minimum 5 years of experience in data management roles with a focus on data governance, ontology, data cataloging, and data lineage.
  • Hands-on experience deploying and operating at least one enterprise data catalog platform (Collibra, Alation, DataHub, OpenMetadata, Purview, or equivalent).
  • Deep expertise in data lineage extraction and representation: column-level, table and system lineage, impact analysis, root-cause tracing across ETL/ELT pipelines.
  • Strong knowledge of data governance frameworks (DAMA-DMBOK, DCAM) and how to operate them in large organizations.
  • Experience with metadata management: technical metadata, operational metadata, business glossaries, and ontology design.
  • Proficiency in Python, SQL, PySpark and familiarity with cloud data platforms (Snowflake, BigQuery, Databricks, Redshift).
  • Experience integrating governance tooling with data pipelines (dbt, Spark, Airflow, Informatica, or equivalent).
  • Strong stakeholder management skills — ability to drive governance adoption with both technical and non-technical audiences.
  • Minimum 2 years of AI engineering experience focused on LLM/agent systems in production.
  • Experience with at least one agent framework (LangChain/LangGraph, AutoGen, CrewAI, Semantic Kernel, or equivalent).
  • Experience with graph databases (Neo4j, Neptune) for lineage storage and traversal.
  • Experience working with XML-based ETL and integration tools such as IBM InfoSphere DataStage, Informatica PowerCenter, and Alteryx for enterprise data integration, transformation, and workflow automation.
  • Strong understanding of OpenLineage standards and lineage frameworks for capturing, tracking, and governing end-to-end data pipeline metadata and lineage across enterprise platforms.

NICE TO HAVE

  • CDMP or equivalent data management certification.
  • Hands-on experience designing agentic architecture: ReAct, plan-and-execute, reflection loops, tool-use patterns.
  • Hands-on experience with Java, Scala, PySpark, and COBOL development.
  • Experience designing and building agentic AI systems (single-agent and multi-agent) with tool usage, memory management, and fallback mechanisms.

WHAT WE OFFER

  • Strategic role with direct impact on the organization’s AI enabled data products and solutions.
  • Competitive salary and flexible working arrangements.
  • Flexible working and modern tooling stack.
Document last updated on: [Date]

Quick Apply

~2 min

Apply through whichever channel suits you best.

CompanyEXL
Departmentna
Posted2 Jul 2026