Sabbir Hossain
👋 Hi, I'm

Sabbir Hossain

Data Engineer at Bell Canada building scalable data infrastructure.
Former bioinformatics researcher at Johns Hopkins and University of Toronto. Harvard NCRC Plenary Speaker, back-to-back ASM ABRCMS award winner.

Available for Hire
Canadian Citizen
Toronto, Canada
Open to USA Relocation

Who I Am

Data Engineer with a Research Background

Executive Summary
Current Role
Data Engineer @ Bell Canada
BBM Division • DE/AI Team • NTS Platform Owner
Research Background
3+ Years Across 3 Institutions
Johns Hopkins • UofT • 750+ TB Multi-Omics Data
Education
UofT Honours BSc • 3.96 GPA
CS + Bioinformatics Specialist • Immunology Minor
Notable Achievements
🎤Harvard Plenary Speaker🏆ABRCMS Best Oral🏆ABRCMS Best Poster📊78K+ Records Recovered83% Query Optimization🧬8 Novel Biomarkers
Core Competencies
Data EngineeringETL/ELT PipelinesData WarehousingSQL OptimizationPythonDistributed SystemsCloud (AWS/GCP)CI/CDMachine LearningResearch & Documentation
Actively Seeking Opportunities
Target Roles: Data Engineering • Platform Engineering • Software Engineering
Canadian Citizen TN Visa Eligible (No Sponsorship Required)Open to Relocation

TL;DR

  • Data Engineer at Bell Canada (BBM — DE/AI), owning production ETLs and analytical systems on NTS — a multi-team, cross-functional platform
  • Build and maintain data infrastructure, dashboards, visualization layers, and business-critical insights used by multiple internal teams
  • University of Toronto Honours BSc (3.96 Major GPA) — CS + Bioinformatics Specialist
  • Harvard plenary speaker (1 of 12 from 5,000+ applicants)
  • ABRCMS Best Detailed Oral & Best Poster Award Winner (top researcher in division)
  • 3+ years research across University of Toronto and Johns Hopkins
  • Looking for Data, Platform, or Software Engineering roles

I'm a Data Engineer at Bell Canada under the Bell Business Markets (BBM) division, within the Data Engineering and Artificial Intelligence Team (DE/AI) where I architect and productionize mission-critical data pipelines on the Network Ticket Service (NTS) Platform. Before going full-time in industry, I spent 3+ years in computational biology research. I graduated from the University of Toronto (St. George Campus) with a 3.96 major GPA in Bioinformatics and Computer Science.

My research at UofT led me to cross-institutional work with Johns Hopkins, and eventually to present at Harvard — where I was selected as 1 of 12 plenary speakers from 5,000+ applicants. I won best presentation awards at ABRCMS in back-to-back years. Along the way, I processed 750+ TB of multi-omics data and realized that data engineering is where I belong — building the infrastructure that makes insights possible.

Data engineering sits at the intersection of software engineering and data science, and I love that. I care about distributed systems, platform engineering, data infrastructure, and creating elegant solutions to complex technical problems. Now I apply that same rigor from research to building enterprise-scale systems.

Canada

Current Location
Canadian Citizen
Based in Toronto, ON
NEXUS Card Holder

United States

Open to Relocation
TN Visa eligible (No sponsorship required)
Open to H-1B / Green Card sponsorship

What I Love

🧠
Learning
Mathematics
💻
Coding
🚀
Space
🧬
Bioinformatics
🤝
Mentoring
🍳
Cooking
📚
Reading
🎮
Gaming
🏃
Fitness

Skills & Tools

Languages

Python
SQL
Java
JavaScript
TypeScript
R
C
Bash

Data & ML

Pandas
PyTorch
TensorFlow
Scikit-learn
Apache Spark
Kafka
Airflow

Cloud & DevOps

AWS
Docker
Kubernetes
Terraform
Jenkins
Git
Linux

Web & Databases

React
Next.js
Node.js
PostgreSQL
MongoDB
Redis
GraphQL

Analytics & Visualization

D3.js
Tableau
Power BI
Jupyter
Excel
Confluence

Methodology & Tools

Jira
Agile
Scrum
Kanban
CI/CD
TDD

Experience

Industry and research roles that shaped how I think about data. From building enterprise-scale pipelines at Bell Canada to processing 750+ TB of multi-omics data across top research institutions.

Industry

Data Engineer

Bell Canada
Jun 2025 - Present

Building and owning the NTS/MS Archway data pipeline for Bell's Network Ticket Service platform, under the Bell Business Markets umbrella serving enterprise and small business customers.

  • Expanded analytical coverage from 1 to 9+ months through systematic root cause analysis
  • Recovered 28,000+ missing records by diagnosing upstream data integrity drift
  • Consolidated 4 enterprise data sources into a unified reporting layer
Read Full Details

Research

Bioinformatics Research Assistant

Johns Hopkins University
Sept 2022 - Present

Cross-institutional oncology research integrating 750+ TB of multi-omics data across 3 cancer types. This role grew out of my work at UofT.

  • Selected as 1 of 12 plenary speakers from 5,000+ applicants at Harvard NCRC
  • Won Best Oral Presentation at ABRCMS 2023 and Best Poster at ABRCMS 2024
  • Contributed to identification of 8 novel biomarker candidates
Read Full Details

Software Development Research Assistant

University of Toronto
Sept 2019 – Apr 2024

Where my research career began. Built full-stack bioinformatics platforms automating workflows across 7 wet lab teams, which led to the Johns Hopkins collaboration.

  • Eliminated 30+ hours of manual work weekly for researchers
  • Created containerized environments that cut onboarding from days to hours
Read Full Details

Impact & Activity

📊
0 yrs 0 mo
Research + Industry
💾
750+ TB
Data Processed
🎤
4
Conference Presentations
🏆
3
Presentation Awards
🎓
3.96
Major GPA
🏛️
3
Institutions
0
GitHub Streak

Projects

Production systems and open-source tools I've built

🔮

NTS/MS Archway Pipeline@ Bell Canada

End-to-end ETL pipeline integrating 4 enterprise data sources into unified Control Plan reporting. 3-tier architecture processing 150,000+ records in ~20 minutes.

PythonSQLSAS DITeradataETL
⏱️

Duration Calculation Engine@ Bell Canada

Stateful Python algorithm computing 3 distinct duration metrics measuring agent work cycles. Two-pass group-by propagation model with zero calculation defects.

PythonPandasAlgorithm Design
🔍

Data Quality Recovery System@ Bell Canada

Full-stack RCA diagnosing systemic data integrity drift. Staged recasts correcting 78,000+ records, expanding analytical coverage by 800%.

SQLData QualityRoot Cause Analysis
🦠

Microbiome Explorer

Interactive visualization platform for exploring microbial community data with taxonomic profiling, diversity analysis, and comparative metagenomics tools.

ReactD3.jsPythonBioinformatics
🧬

Bioinformatics Platform@ Johns Hopkins

Open-source full-stack bioinformatics platform with interactive D3.js visualizations, R Shiny dashboards, and real-time WebSocket data streaming. Private deployment at Johns Hopkins.

ReactD3.jsR ShinyPythonDocker
🚀

Portfolio Website

This cyberpunk-inspired portfolio built with Next.js featuring animated backgrounds, custom cursor, and glassmorphism design.

Next.jsReactCSSCanvas API

Let's Build Something Together

Looking for my next opportunity in data engineering or platform engineering.

PDF