Workflow Element Store

  1. APIs and Data Feeds
  2. Data Generation
  3. Unstructured data (Images / Videos)
  4. Data Logging
  5. WebScraping
  6. Public Datasets
  7. Mobile Applications or IoT Applications
  8. Data Collaboration and Partnerships
  9. Crowdsourcing
  10. Data Pre-existing
  11. Unstructured data (Audio)
  12. Structured Data (Tabular)
  13. Surveys and Questionnaires
  1. MS SQL server
  2. GCP BigQuery
  3. S3
  4. Azure Data Warehouse
  5. NoSQL DB
  6. Azure blob storage
  7. MySQL
  8. GCS
  9. AWS Redshift
  10. RDBMS
  11. PostgreSQL
  12. Oracle DB
  13. Informatica
  1. Interaction Features
  2. Time-Based Features
  3. Data Scaling and Normalization
  4. Domain-Specific Feature Engineering
  5. Auto-Preprocessing libraries
  6. Dealing with Outliers
  7. AutoEDA libraries
  8. Dimensionality Reduction
  9. Data Scaling and Normalization
  10. Binning
  11. Encoding Categorical Variables
  12. Logarithmic Transform
  13. Feature Extraction from Images
  14. Handling Imbalanced Classes
  15. Handling Missing Data
  16. Polynomial Features
  17. Textual Feature Extraction
  18. Handling Time-Series Data
  19. Handling Categorical Data
  20. Dimensionality Reduction
  21. Handling Noisy Data
  22. Feature Selection
  1. Supervised Learning-multiclass classification
  2. Time Series Anaysis
  3. Train-Test Split
  4. Ensemble Techniques
  5. Supervised Learning-binary classification
  6. Unsupervised Learning
  7. Forecasting
  8. Data Partitioning
  9. Blackbox Techniques
  10. Supervised Learning-Regression
  1. Train-Test Split
  2. Data Partition-sequential
  3. Batch Size Selection
  4. Ensemble Methods
  5. Transfer Learning
  6. Hyperparameter Tuning
  7. Learning Rate Scheduling
  8. Gradient Clipping
  9. Batch Normalization
  10. Data Augmentation
  11. Regularization
  12. Early Stopping
  13. Weight Initialization
  14. Cross-Validation
  15. Regular Monitoring and Logging
  1. Evaluation Metrics
  2. Data Partitioning
  3. Model Comparison
  4. Train-Test Split
  5. Performance Visualization
  6. Cross-Validation
  7. External Validation
  8. Hyperparameter Tuning
  9. Model Interpretability
  10. Regularization Techniques
  1. Monitoring and Logging
  2. Feedback Collection
  3. Prediction Logging
  4. Model Versioning
  5. Model Monitoring and Maintenance
  6. Performance Metrics
  7. Concept Drift Detection
  8. Continuous Integration and Deployment (CI/CD)
  9. Data Drift Monitoring
  10. Model Drift
  11. Documentation and API Documentation
  12. Model Health Monitoring
  13. Web APIs - Flask, FastAPI, etc.
  14. Model Serialization
  15. Streamlit
  16. Documentation and Reporting
  17. A/B Testing
  18. Alerting and Notification
  19. Cloud Deployment
  20. Model Retraining and Updating
  21. Edge Deployment
  22. Containerization
  23. Model Registry
  24. Serverless Computing
  25. Security Considerations
  26. Bias and Fairness Assessment
  27. Error Analysis
  1. End User Machine
  2. Mobile
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry