About Me
I am Wajahatullah Khan, a seasoned Data Architect and Engineer with over a decade of experience specializing in data-centric roles, big data management, cloud services, and enterprise-level data stack implementation. Throughout my career, I have demonstrated a strong ability to lead complex data transformation projects and drive innovation in various industries, including telecommunications, financial services, media, and insurance.
Currently, I am a Data Architect at Pasha Insurance, where I am spearheading the company's data architecture transformation, moving from legacy systems to modern, scalable solutions. My work here has significantly enhanced data maturity, improved decision-making capabilities, and positioned the company as a leader in data-driven practices.
Previously, I have held pivotal roles at Afiniti, where I led the deployment and integration of advanced data stacks, successfully transitioning to distributed systems. My expertise in designing scalable, cloud-native architectures.
I am passionate about leveraging data to drive business growth and technological advancement. My commitment to excellence is reflected in my continuous efforts to mentor the next generation of data professionals and contribute to the field of data engineering through innovative solutions.
My Services
Data Architecture & Engineering
Design and implement robust data architecture frameworks to enhance data management, scalability, and integration. Utilize advanced cloud-native technologies to streamline data processing and optimize system performance.
AI & Machine Learning Solutions
Develop AI-driven models and machine learning algorithms to enhance decision-making processes and predictive analytics. Specialize in creating tailored solutions that leverage big data for strategic business insights.
Cloud Migration & Optimization
Facilitate seamless migration of legacy systems to modern, scalable cloud infrastructures. Optimize cloud resources to improve operational efficiency and reduce costs, ensuring a smooth transition to cloud environments.
Data Governance & Compliance
Implement comprehensive data governance frameworks to ensure data quality, security, and compliance with national and international regulations. Develop policies and controls that protect sensitive information and uphold privacy standards.
Advanced Analytics & Reporting
Build and maintain sophisticated analytics platforms to support data-driven decision-making. Create custom dashboards and reports that provide real-time insights and enhance strategic planning capabilities.
Big Data Infrastructure Design
Architect and deploy large-scale data infrastructure to handle petabyte-scale datasets. Leverage cutting-edge technologies like Apache Hadoop, Spark, and Kubernetes for efficient data storage and processing.
Education
03/2021 – 08/2023 | Isık University, Istanbul, Turkey
- Achieved a 100% scholarship based on entry test andperformance during the interview process.
- Thesis Publication in the International Journal ofScientific and Engineering Research (IJSER), ISSN2229-5518.
09/2009 – 06/2013 | NUST, Islamabad, Pakistan
Work Experience
Data Architect
- Leading a critical transformation of the company's data architecture, migrating from legacy systems to modern, scalable technology solutions, resulting in a 40% reduction in data processing time.
- Enhancing data maturity and optimizing AI-driven decision-making processes, ensuring the company adheres to both national and international standards.
- Implemented robust data governance, security, and privacy practices, critical for maintaining customer trust in the insurance industry and which led to a 50% decrease in data errors and improved overall data accuracy.
Data Architect
Working for TelevisaUnivision, designing and implementing complex data pipelines on GCP using Composer and BigQuery Focused on optimizing reach and frequency queries to support advanced analytics for a leading Mexican-American media company
Senior Data Expert
- Spearheaded projects that dramatically improved customer service interactions, positively impacting millions of users worldwide.
- Managed the end-to-end architecture, analysis, and implementation of large-scale big data projects leveraging cuttingedge technologies to deliver high-impact data models and insights.
- Develop and maintain robust value chains, addressing the challenges of acquiring, evaluating, and distilling data from diverse sources, ensuring its value and effective utilization in critical decision-making processes.
- Proven ability to mentor and supports a team of data and backend engineers, guiding daily tasks and professional development.
- Collaborated closely with cross-functional teams, actively contributing to the development of assets, offerings, and thought leadership initiatives, fostering a culture of innovation and collaboration within the organization.
- Continuously refine data architecture strategies and methodologies, optimizing data storage, retrieval, and analysis processes to enhance overall data quality, performance, and accessibility.
Data Architect
- Implemented Apache Airflow and Spark, further optimizing data processing pipelines and enhancing the performance of AI-driven products like Afiniti-Decisioning.
- Led the migration from MySQL to Greenplum, reducing data processing time by over 80%, thereby enabling faster and more accurate AI-driven decisions. Managed, architected, and analyzed big data to develop data-driven insights and high-impact data models, driving business growth and innovation.
- Post-migration, the AI-driven platform saw a 25% increase in customer satisfaction scores due to enhanced real-time decision-making capabilities.
- The improvements in AI performance and decision accuracy contributed to a 10% increase in revenue.
- Created value chains to overcome challenges in acquiring, evaluating, and distilling data from multiple sources, ensuring its value and effective use in decision-making processes.
- Extracted strategic insights from diverse data sources, providing a competitive advantage by identifying trends, patterns, and actionable information.
- Collaborated with cross-functional teams, contributing to the development of assets, offerings, and thought leadership initiatives across the organization.
- Continuously refined data architecture strategies and methodologies to optimize data storage, retrieval, and analysis, enhancing overall data quality and accessibility.
- Employed advanced analytics techniques to support data-driven decision-making, facilitating the seamless integration of analytics into business processes.
Lead Data Engineer
- Developed automated, scalable batch data pipelines, enhancing data processing efficiency and reliability.
- Defined new processes, identified optimal tools and technologies for each project, and maintained data integrity through intelligent data sensors.
- Optimized and accelerated data flows to eliminate bottlenecks, collaborating with various teams to adapt to evolving client requirements.
- Developed scalable, distributed big data architectures and data pipelines for clients, providing end-to-end support using Talend.
- Implemented continuous software improvements by analyzing user data and translating insights into product enhancements using Python and Java.
- Deployed Enterprise BI tools, improving data processes and designing automation strategies that ensured high data quality.
Senior Data Engineer
- Played a crucial role in the development of the ID Graph project at Adara, a leading data-driven marketing company, by linking various data sources to create a unified view of customer identities. This project was instrumental in enabling the company to run more targeted and personalized marketing campaigns, which improved customer engagement and business performance.
- Established and maintained strict data privacy and governance standards, contributing to zero compliance violations during the project.
- Implemented data processing workflows on Google Cloud for both real-time and batch processing scenarios.
Senior Software Engineer, Business Intelligence
- Collaborated with Project Managers, Business Analysts, and Developers on diverse projects to clearly define business needs, expectations, timelines, and execution methods, effectively communicating with various business groups.
- Provided data warehousing services to financial institutions using Ingres and PostgreSQL databases.
- Assisted in the design and implementation of data marts and data warehousing applications, extracting data from OLTP systems to staging areas using ELT, and loading data to target databases using ETL processes as per business requirements.
Data Analyst
- Performed data transformations, including roll-up and drill-down analysis, interpreting and analyzing data trends using statistical techniques.
- Analyzed data from multiple sources, optimizing statistical efficiency and quality by developing data collection systems and filtering data to remove outliers.
- Scheduled and executed SSIS packages using SQL Server Agent for efficient data processing.
- Designed SSIS packages to extract data from various sources, such as Access databases, Excel spreadsheets, and flat files, loading data into destination databases for further analysis and reporting.
Database Developer
- Managed the planning and development of design and procedures for metrics reporting, ensuring accurate and insightful data representation.
- Created Stored Procedures, Triggers, Functions, Indexes, Views, Joins, and T-SQL code to support application development and optimization. Managed indexes and statistics, optimizing queries using execution plans for efficient database tuning.