Masood Salman Choudhury

Senior Data Engineer & AI Solutions Architect

Manchester, Manchester, United Kingdom
Masood Salman Choudhury

About

Senior Data Engineer & AI Solutions Architect with 5+ years' experience delivering end-to-end data platforms and intelligent applications across fintech, SaaS, and industrial analytics. Expert in designing scalable data pipelines, building AI-powered systems and Agentic AI with LLMs and machine learning models, and deploying robust cloud-native solutions on AWS, Azure, and GCP.

Experience

  • -

    Netherlands - Remote

    Summary:

    • Senior data and AI lead delivering customer-facing projects end to end: scalable Azure and AWS data platforms, agentic AI and RAG, LLM fine-tuning and serving, full-stack SaaS, and secure cloud operations with strong stakeholder partnership and mentorship.

    Responsibilities:

    • Led customer-facing delivery by partnering with stakeholders to design and deploy innovative end-to-end solutions across Azure and AWS, translating business problems into scalable data, AI, and platform architectures
    • Designed and built large-scale Azure Databricks pipelines using PySpark and SQL to ingest, clean, and transform multi-source datasets (Blob Storage, Cosmos DB); implemented real-time streaming with Kafka; delivered curated Delta Lake layers aligned with Kimball-style dimensional modeling for analytics and ML workloads
    • Led development and deployment of multiple production-grade full-stack AI SaaS platforms on Azure using FastAPI, Next.js, React Native and Expo (Android and iOS), OAuth 2.0, PostgreSQL, and Stripe
    • Architected and deployed multi-stage agentic RAG, chatbot, and automation systems using OpenAI Agent SDK, LangChain, LangGraph, CrewAI, Pinecone, and custom Python pipelines integrating Slack and Google Drive
    • Built ML-ready datasets in Delta Lake, supported downstream model training, and optimized vector search and reranking pipelines for retrieval quality
    • Fine-tuned domain-specific LLMs using Unsloth with LoRA and deployed them into production AI workflows to improve inference accuracy
    • Deployed vLLM and llama.cpp for high-performance LLM serving optimized for low latency and high concurrency
    • Designed CI/CD with GitHub Actions deploying containerized applications to AWS ECS using Docker, Terraform, and infrastructure as code; implemented Zero-Trust security, SSL/TLS, firewall rules, and secure API gateways
    • Implemented centralized logging and monitoring with Prometheus and Grafana for observability
    • Mentored engineers through code reviews, architectural guidance, and engineering best practices
    • Python
    • PySpark
    • SQL
    • Databricks
    • Delta Lake
    • Kafka
    • FastAPI
    • Next.js
    • React Native
    • Expo
    • LangChain
    • LangGraph
    • CrewAI
    • Pinecone
    • PostgreSQL
    • MySQL
    • Docker
    • AWS
    • Azure
    • Terraform
    • GitHub Actions
    • Git
    • Prometheus
    • Grafana
    • Nginx
  • -

    Manchester, United Kingdom - On-Site

    Summary:

    • Architected Kimball-style warehousing and scalable financial ETL on GCP; co-led a FastAPI savings product and analytics supporting security and growth.

    Responsibilities:

    • Architected a Kimball-style star-schema data warehouse using Elasticsearch and BigQuery for real-time KPI dashboards in Kibana
    • Built scalable ETL pipelines with Python, GCP Dataflow, Scrapy, and Apache Airflow processing 10M+ financial rows daily
    • Co-led development of a FastAPI-based Savings Platform processing $2M+ monthly deposits
    • Optimized MongoDB, PostgreSQL, Elasticsearch, and BigQuery for performance
    • Performed anomaly detection and fraud analysis using Pandas
    • Automated data validation workflows ensuring pipeline integrity and uptime
    • Deployed applications using Docker and Kubernetes
    • Built a Random Forest model to identify high-value clients
    • Python
    • Elasticsearch
    • BigQuery
    • Kibana
    • GCP
    • FastAPI
    • MongoDB
    • PostgreSQL
    • Apache Airflow
    • Pandas
    • Docker
    • Kubernetes
  • -

    Guwahati, India - On-site

    Summary:

    • Delivered actionable insights via Tableau dashboards and automated data collection processes.

    Responsibilities:

    • Delivered actionable insights via Tableau dashboards (waterfall/cohort analysis), improving stakeholder decision-making
    • Automated competitor data scraping (Scrapy) and ETL into MySQL, reducing manual effort by 50%
    • Analyzed sales and geographic data for fiber network expansion strategy
    • Python
    • Tableau
    • MySQL
    • Scrapy
    • Pandas

Projects

Education

    University of Liverpool

    MSc Business Analytics and Big Data
    Liverpool, United Kingdom
    Distinction
    Key Modules: Data Mining & Machine Learning, Big Data Analytics, Enterprise Systems with SAP, Digital Business Technology and Management, Digital Strategy

    Asian Institute of Management and Technology

    Bachelor of Business Administration
    Guwahati, India
    First-Class
    Key Modules: Statistics, Mathematics, Production & Operation Management

Certificates

Skills

  • Python
  • Databricks
  • PySpark
  • Delta Lake
  • Langchain
  • Pinecone
  • React Native
  • PostgreSQL
  • MySQL
  • Docker
  • AWS
  • Azure
  • GCP
  • Terraform
  • Git
  • Elasticsearch
  • BigQuery
  • FastAPI
  • MongoDB
  • Apache Airflow
  • Pandas
  • Kubernetes
  • Tableau
  • Scrapy
  • Prometheus
  • Grafana
  • Nginx
  • Kafka
  • Next.js
  • Expo
  • LangGraph
  • CrewAI
  • GitHub Actions
  • MLflow
  • Weaviate
  • Scikit-learn
  • PyTorch
  • Django
  • Go
  • C++