Sabbir Hossain
๐Ÿ‘‹ Hi, I'm

Sabbir Hossain

Data engineer. Backend-minded. Platform-first.

I build data and backend systems that need to hold up in the real world. Bell Canada is the current chapter. Before Bell, I spent years at Johns Hopkins and the University of Toronto building research software, bioinformatics tooling, and data-heavy platforms.

Open to the right role Canadian citizen Based in Toronto, Canada TN-eligible for US roles Open to US relocation
๐ŸŽค Harvard NCRC plenary + poster presenter๐Ÿ† ABRCMS oral + poster presentation awards๐Ÿ“ Lead author on 3 manuscripts under review
Interactive map

Start here first.

Start in the middle and work outward. The larger nodes are the chapters. The smaller ones are the proof. A clean first pass is Bell, then shipped systems, then the resume.

Big nodes are main chaptersEvery node opens detail, proof, or the PDF
Mobile view

The phone layout switches to a guided navigator, so the map stays readable without sideways scrolling.

At a glanceExperienceBell
Experience / Bell

Bell

This is the production chapter: NTS ownership, nationally scaled CS Attack RCA work, CSAP delivery, and day-to-day responsibility in a live telecom environment.

Mapped size34 facts
AreaCareer graph
Fast readBell + research

Primary owner of the Network Ticket Service pipeline

CS Attack RCAs for 2 directors, 1 VP, and engineering managers across the country

CSAP pipeline work for directors, engineering managers, BI teams, and 20+ downstream consumers

๐Ÿงพ Quick read

Start here for the fast read.

I am a Data Engineer at Bell Canada on the Data Engineering & Artificial Intelligence team. Most of my day-to-day is production pipeline ownership, analytics platform work, cross-domain debugging, and making sure the data layer actually holds up.

Right now

Data Engineer at Bell Canada

Bell Business Markets, Data Engineering & Artificial Intelligence team

Research background

Nearly 6 years pre-industry

University of Toronto and Johns Hopkins across software, ML, and bioinformatics

Academic foundation

Honours BSc, 3.96 major GPA

Computer Science + Bioinformatics specialist with an Immunology minor

๐Ÿ“Œ Why this all fits together

Before Bell, I spent nearly six years building research software across the University of Toronto and Johns Hopkins. I still continue some Hopkins research in my spare time because I genuinely enjoy the work. That path led to Harvard NCRC, oral and poster presentation wins at ABRCMS, 750+ TB of multi-omics data, and three lead-author manuscripts now under review.

The best fit for me right now is primary data engineering work, with a clear path toward data platform engineering and strong overlap with backend or infrastructure-heavy software roles. I like clear abstractions, durable systems, and solving messy technical problems without turning them into a circus.

๐Ÿ› ๏ธ Best fit
Data engineeringBackend systemsPlatform thinkingETL / ELT pipelinesData warehousingSQL optimizationDimensional modelingCloud architectureCI / CDTechnical leadership
โœจ Still a human being
๐Ÿง  Learningโž— Mathematics๐Ÿ’ป Coding๐Ÿš€ Space๐Ÿงฌ Bioinformatics๐Ÿค Mentoring๐Ÿณ Cooking๐Ÿ“š Reading๐ŸŽฎ Gaming๐Ÿƒ Fitness
Home base

Canada

Canadian citizen. Fully authorized to work in Canada.

Based in Toronto, Ontario. Home base is clear, with flexibility for the right team setup.

NEXUS card holder. Cross-border travel is easy when the work needs it.

US work status

United States

TN visa eligible. No sponsorship track, lottery, or employer immigration cost burden.

Open to US relocation.

Open to long-term paths. H-1B or green card sponsorship is fine if the role grows that way.

๐Ÿ’ผ Experience

Bell is where I'm focused now. Hopkins and UofT built the depth.

Open the archive
๐Ÿ“ก Industry

Data Engineer

Bell Canada

Primary owner of Bell's Network Ticket Service pipeline, with growing ownership across CS Attack RCA work and CSAP delivery for cross-country Bell stakeholders.

78attributes delivered
78,000+records recovered
83%query optimization
  • Built and productionized the mission-critical Network Ticket Service data pipeline on Teradata using a three-tier ETL and ELT architecture across staging, warehouse, and analysis layers.
  • Integrated four operational systems including REST API event streams, ERP, billing, and directory services using Python and SAS Data Integration while enforcing data contracts and Kimball-style dimensional modeling patterns.
  • Built a stateful sessionization algorithm in Python to fix event sequencing defects, refactoring a flawed sequential method into a robust two-pass group-by propagation model.
Read the full breakdown
๐Ÿงฌ Research

Bioinformatics Software Development Research Assistant

Johns Hopkins University

Ongoing spare-time oncology research, full-stack bioinformatics platforms, and ML-driven multi-omics analysis on HPC infrastructure.

750+ TBdata integrated
8novel biomarkers
83%load time reduction
  • Reduced analysis load times by 83 percent through optimized caching on a full-stack bioinformatics platform supporting 100+ global researchers.
  • Built the platform using Python, R, JavaScript, and C with microservices architecture, SOLID principles, and Docker containerization.
  • Engineered scalable ETL pipelines processing over 750 terabytes of multi-omics data on HPC clusters, accelerating biomarker discovery by 40 percent.
Read the full breakdown
๐ŸŽ“ Research

Software Development Research Assistant

University of Toronto

The foundation: software platforms, reproducible research tooling, and workflow automation across multiple wet-lab teams.

30+hours saved weekly
7research teams
50%setup time reduction
  • Reduced analysis effort by more than 30 hours per week across 7 research teams by engineering full-stack bioinformatics platforms.
  • Built automation using Python, R, C, and Java with object-oriented programming patterns to streamline lab workflows.
  • Owned the full software development life cycle from requirements through deployment and maintenance.
Read the full breakdown
๐Ÿงฑ Projects

Selected systems that show how I build when the work has to ship.

View all projects
๐Ÿ“ŠBell Canada

Enterprise Analytics Platform

78-attribute MicroStrategy analytics platform integrating SmartPath, Maximo, IPACT, and LDAP into one decision surface.

Built derived metrics, conditional formatting, cross-filter interactivity, and a structured migration path from development to production with director sign-off.

๐Ÿ”ฎBell Canada

NTS/MS Archway Pipeline

Three-tier ETL pipeline integrating SmartPath API, Maximo, IPACT, and LDAP into unified Control Plan reporting.

Processes 150,000+ records in roughly 20 minutes across staging, warehouse, and analytics layers using schema-aware loads across DEV, QA, and PROD.

๐Ÿ”Bell Canada

Data Quality Recovery System

Full-stack RCA effort that corrected historical data integrity drift and restored analytical confidence.

Executed staged historical recasts correcting 78,000+ records, expanding analytical coverage from 1 month to 9+ months and improving match accuracy to the strongest level since inception.

๐ŸงฌJohns Hopkins University

Bioinformatics Platform

Open-source full-stack bioinformatics platform used by researchers for visualization, simulation, and analysis workflows.

Built with React, D3.js, R Shiny, Python, WebSockets, and Docker using microservices architecture and SOLID design principles.

๐ŸงชJohns Hopkins University

Multi-Omics Data Pipeline

Scalable processing pipeline integrating DISQOVER, ENCODE, PCAWG, PRIDE, and TCGA data for cancer biomarker analysis.

Applied SVM-RFE, Random Forest, and HPC workflows to help identify 8 novel biomarkers and accelerate validation timelines.

๐ŸงผIndependent build

ProofMark Studio

Hub for the ProofMark document-craft tool line โ€” one catalog of ~50 PDF, text, and publishing utilities built as a single React SPA over a thin FastAPI shell.

Three sibling FastAPI apps (hub, proofmark-pdf, text-cleaner) composed by URL rather than imports, so each surface stays independently editable, deployable, and testable. The hub renders the catalog, routes to each tool, and shares a design system of color tokens and SVG illustrations across tools.

๐Ÿ† Third-party proof

Third-party signal that the work actually landed, including plenary, oral, and poster presentations.

2024

Plenary Speaker

National Collegiate Research Conference (NCRC) - Harvard University

Selected as 1 of only 12 plenary speakers from over 5,000 national applicants. Delivered a keynote on integrating transcriptomics and proteomics data for glioblastoma research.

Verify: NCRC 2024 handbook (PDF) ยท About NCRC (HURA) ยท Harvard FAS NCRC page

2024

Poster Presentation

National Collegiate Research Conference (NCRC) - Harvard University

Presented computational approaches for cancer biomarker identification using integrated multi-omics analysis and machine learning methodologies.

Verify: NCRC 2024 handbook (PDF) ยท About NCRC (HURA)

2023

Oral Presentation Award

Annual Biomedical Research Conference for Minoritized Scientists (ABRCMS) - Computational and Systems Biology Division

Awarded top presenter in the Computational and Systems Biology division, selected from 80 oral presenters at a conference with over 3,500 attendees.

Verify: Award certificate (PDF) ยท ABRCMS 2023 program (PDF) ยท ABRCMS official site

Review the full awards archive
โš™๏ธ Stack

The tools I actually reach for when the work has to land.

Languages

Core languages for backend, scripting, and analytical work.

PythonSQLJavaJavaScriptTypeScriptRCBash
Data & ML

ETL pipelines, dimensional modeling, and ML workflows โ€” the data plumbing that has to hold up under real load.

PandasPyTorchTensorFlowScikit-learnApache SparkKafkaAirflow
Cloud & DevOps

Deployment, orchestration, infrastructure, and platform operations.

AWSGCPBigQueryDockerKubernetesTerraformGitLinux
Web & Data Stores

Application surfaces, APIs, and data systems.

ReactNext.jsNode.jsPostgreSQLMongoDBRedisGraphQL
Analytics & Visualization

BI, charting, and exploratory analysis.

MicroStrategyD3.jsTableauPower BIJupyterExcel
Methodology & Delivery

Delivery discipline, team workflow, and release hygiene.

JiraConfluenceAgileScrumKanbanCI/CDTDD
๐ŸŽ“ Foundation

Strong CS and bioinformatics foundations, plus a lot of range.

Schooling

University of Toronto

Campus
St. George Campus
Degree
Bachelor of Science (Honours)
Graduated
June 2024
Specialist
Computer Science, Bioinformatics & Computational Biology
Minor
Immunology
Major GPA
3.96 / 4.0
๐Ÿ“š Coursework highlights

Computer Science

CSC108H1 / CSC148H1 / CSC165H1 / CSC207H1 / CSC209H1 / CSC236H1 / CSC263H1 / CSC373H1

Bioinformatics & Computational Biology

BCH441H1 / BCB410H1 / BCB420H1 / BCB330Y1 / BCB430Y1

Mathematics & Statistics

MAT135H1 / MAT136H1 / STA247H1 / STA237H1

Biochemistry & Immunology

BCH210H1 / BCH311H1 / IMM250H1 / IMM340H1 / IMM350H1

See the complete course history
๐Ÿ“ง Contact

If you're hiring for serious technical ownership, let's talk.

I'm focused on data engineering, platform, backend, and software roles where architecture, reliability, and follow-through actually matter.