Profile

Data-Driven Engineer

Architecting real-time pipelines, building ML models, and transforming data into insights

6+
Dashboards in Prod
137+
Deployments
10K+
Users Across Clusters
New Project Launch

Carbon: How It Works

Carbon is a live product focused on helping users estimate footprint, understand where emissions come from, and prioritize improvements.

STEP 01

Collect Usage Inputs

Users enter operational and activity data such as travel, energy, and resource consumption through a guided flow.

STEP 02

Estimate Emissions

Carbon converts raw activity data into emission estimates using standardized calculation logic and transparent assumptions.

STEP 03

Surface Actionable Insights

A focused dashboard highlights footprint hotspots and reduction opportunities to support faster, data-backed decisions.

Explore the live product

Test the full user flow and see how Carbon turns operational input data into decision-ready sustainability insights.

Featured Projects

Real-world solutions across data engineering, ML, and analytics

๐Ÿค–
ML/Data Science

ML-Powered Resource Prediction System

Built end-to-end ML system predicting optimal compute resources for job submissions. Implemented interactive dashboard with user authentication, vectorization (TFIDF, CountVectorizer), clustering (K-means, DBScan), and confidence scoring. Migrated from static reporting to dynamic analytics achieving 95%+ performance improvement.

95% faster dashboard | 137+ deployments | Production-grade ML pipeline
PythonShinyScikit-learnClusteringAWS EC2Feature Engineering
๐Ÿ“Š
Data Analytics

Multi-Cluster Monitoring Dashboard

Designed and deployed production Tableau dashboard monitoring 6+ compute clusters across global locations. Implemented real-time resource allocation tracking, memory utilization analysis, and capacity visualization. Managed data consistency migration to new database system.

6+ clusters monitored | 100% data integrity | Zero downtime DB migration
TableauSQLAWS AthenaData ValidationProduction Monitoring
๐Ÿ“ˆ
Data Analytics

Workload Analytics & Optimization Platform

Developed comprehensive workload analyzer tracking job statistics, cluster performance, and resource utilization. Fixed critical bugs (date filtering, obsolete data visibility, labeling). Implemented analytics queries for user tracking across multiple compute clusters.

Fixed 4+ critical bugs | Date filter automation | Multi-cluster tracking
TableauSQLAthenaData PipelineChange Management
๐ŸŽฏ
ML/Data Science

Project Timeline Prediction Model

Built ML model predicting project delays using project management data. Implemented Recursive Feature Addition (RFA) with Random Forest Classifier. Engineered features (milestone position, task duration, completion percentage) with 4+ stages of cross-validation. Created fallback logic for missing baseline dates.

Multiple feature rankings | Regression & classification models | Production-ready
PythonRandom ForestFeature EngineeringProject DataSQL
โšก
Data Analytics

Tool Performance Analytics Platform

Conducted exploratory analysis on engineering tool test results from compute and design teams. Integrated with log aggregation systems, created dashboards highlighting performance patterns and bottlenecks. Performed tool analysis extracting actionable insights for optimization.

Tool performance patterns identified | Stakeholder insights | Dashboard ready
SplunkPythonS3Exploratory AnalysisDashboard Design
๐Ÿ›ก๏ธ
DevOps/Infrastructure

Reporting Platform & DevOps Automation

Maintained enterprise reporting platform with data verification, automated cloud storage management via PowerAutomate workflows. Fixed cloud analytics data source issues, managed secret key rotation with Lambda, handled data source tagging and ownership. Optimized data ingestion frequency reducing costs.

Storage auto-management | Cost optimization | 100% credential security
Tableau CloudPowerAutomateLambdaS3SQLDevOps

Side Projects

Personal projects demonstrating technical skills and data analysis expertise

Skills & Expertise

Proficient across modern data stack and cloud platforms

Programming

๐Ÿ“SQL
96%
๐ŸPython
92%
๐Ÿ“ŠR / Shiny
88%
โš™๏ธFlask / Web Dev
78%

BI/Analytics

๐Ÿ“ˆTableau
94%
โ˜๏ธTableau Cloud
90%
๐Ÿ”Splunk
85%
๐ŸŽจDashboard Design
91%

Cloud

๐Ÿ—‚๏ธAWS Athena
89%
๐Ÿ’พAWS S3
87%
๐Ÿš€AWS EC2
85%
โšกAWS Lambda
82%

DevOps

๐Ÿ”„PowerAutomate Workflows
87%
๐Ÿ”—Data Pipeline Design
89%
๐Ÿ“ฆGit & CI/CD
84%
๐Ÿ”€Database Migration
85%

ML/Data Science

๐Ÿ”งFeature Engineering
91%
๐ŸŒณRandom Forest / Clustering
88%
๐Ÿ“šScikit-learn
90%
โœ…Data Validation & QA
93%
Data Engineering
Pipelines & ETL
Analytics
BI & Visualization
Machine Learning
Predictive Models
Cloud DevOps
Infrastructure & Scale

Career Journey

8+ years of progressive experience in data

2024-Present

BI DevOps Engineer

NXP Semiconductors

Leading data analytics, ML, and DevOps initiatives for HPC cluster management and job prediction systems

  • โœ“Built and deployed 6+ production dashboards serving 10K+ users across global compute clusters with real-time monitoring
  • โœ“Engineered ML pipeline with feature engineering achieving 95% performance improvement using vectorization & clustering algorithms
  • โœ“Managed zero-downtime database migration ensuring 100% data consistency across analytics products
2023-2024

Data Analyst

Techno Teams

Analytics and data-driven insights for marketing and sales optimization

  • โœ“Assessed marketer performance through internal data analysis, achieving 35% increase in campaign effectiveness
  • โœ“Analyzed sales data from new online store, delivering demographic insights that drove 27% increase in targeted sales strategies
  • โœ“Designed interactive dashboards and reports for key business metrics, improving decision-making efficiency by 40%
2023

Research Intern

ABN AMRO Bank N.V.

Research and stakeholder analysis for business and IT alignment

  • โœ“Interviewed and assessed work procedures of 20+ stakeholders to differentiate business vs IT perspectives
  • โœ“Conducted thematic data analysis to identify data patterns and facilitate data-driven decision making
  • โœ“Supported strategy planning through comprehensive stakeholder research and insights
2017-2019

Team Lead

Techno Teams

Technical Writing & Content Strategy Department

  • โœ“Led team of 5+ technical writers in creating documentation, user guides, and knowledge base articles for multiple products
  • โœ“Established content standards and style guides, improving documentation consistency and reducing review cycles by 30%
  • โœ“Mentored junior writers on technical communication best practices, contributing to team skill development and retention

Insights & Articles

Sharing knowledge on data engineering, ML, and analytics

Data Engineering12 min read

ACID in Data Engineering: From Simple Examples to Distributed Systems Internals

A practical deep dive into how Atomicity, Consistency, Isolation, and Durability are implemented across databases, lakehouses, and distributed systems.

#ACID#Distributed Systems#Lakehouse#Spark#Transactions

Let's Connect

Open to collaboration, opportunities, and discussing all things data

โœ‰๏ธ

Email

Reach out for collaboration or opportunities

tanmoy.tanvir001@gmail.com
๐Ÿ“…

Schedule a Call

Let's discuss your data challenges

Book a 30-min call

Interested in working together on data projects or want to discuss your pipeline architecture?

Send me an Email