Questions to Ask
- What are the constraint of the system?
- End devices?
- What is the use of the model?
- Data characteristics
- Size
- Output - Categorical or Continuous
- Labeled - Supervised Learning / Unsupervised Learning / Semi-supervised Learning
- Missing data - Handling Missing Data
- Imbalanced? - Handling Imbalanced Dataset
- Outliers - Handling Outliers
- Which Machine learning to use?
- Machine Learning Algorithm Selection
- Need to be interpretable?
- Online Learning?
- Recommendation system?
- Model Evaluation
- #evaluation
- Positive is more important or negative