Tools & Resources
Interactive tools, comprehensive guides, and hands-on notebooks for data science, machine learning, and AI development.
Privacy Scanner
Detect and redact personally identifiable information (PII) from text and files. Supports 30+ PII types including emails, SSN, API keys.
Data Validator
Validate manufacturing and supply chain data with 41+ business rules across 7 schemas. Comprehensive or selective validation modes.
Inference Estimator
Estimate inference costs and latency for LLM deployments across different providers and model sizes.
Cost Tracker
Track and compare API costs across OpenAI, Anthropic, and other LLM providers. Optimize your AI spending.
Drift Monitor
Monitor model performance and detect data drift in production ML systems. Get alerts when models degrade.
Decision Flowchart
Interactive decision tree to help you select the appropriate statistical test based on your data type and research question.
Statistical Tests
A comprehensive guide to choosing the right statistical test. Interactive fishbone diagram to navigate parametric vs non-parametric options.
Data Visualization
Learn the principles of effective data visualization. From choosing the right chart type to creating compelling visual narratives.
ML Model Selection
A practical guide to selecting the right machine learning model for your problem. Decision trees, neural networks, and more.
Feature Engineering
Transform raw data into meaningful features. Best practices for numerical, categorical, and text data.
EDA Gapminder
Explore global development data with interactive visualizations. GDP, life expectancy, and population trends from 1952-2007.
House Price Predictor
Seattle/King County house price prediction with ML. Explore 21,613 houses on an interactive map and get instant price estimates.
Exploratory Data Analysis
Exploratory data analysis using the Gapminder dataset. Learn data wrangling, visualization, and insights extraction.
Statistical Tests Practice
Hands-on practice with t-tests, ANOVA, chi-square, and correlation analysis using real datasets.
Linear Regression Deep Dive
From simple to multiple regression. Understand assumptions, diagnostics, and interpretation.
Classification Models
Logistic regression, decision trees, and random forests. Compare model performance and interpret results.
Time Series Analysis
Analyze temporal data with ARIMA, seasonal decomposition, and forecasting techniques. Work with real-world time series datasets.
Prompt Optimizer
Optimize prompts for LLMs with A/B testing and performance metrics. Compare outputs across different prompt variations.