Train/dev/test/distribution

Choose a dev set and test set to reflect data you expect to get in fth future and cosindier important to do well on.

Size of the dev and test sets

Traditionally, train/test: 70%, 30%; or train/dev/test:60%, 20%, 20%

Currently, train/dev/test: 99%, 1%, 1% for deep learning algorithm.

Size of test set: set your test set to be going enough to give high confidence in the overall performance of your system.

When to change dev/test sets and metrics?

If doing well on your metric + dev/test set don't correspond to doing well on your application, change your metric and/or dev/test set.

Summary of bias/variance with human-level perforamnce

  • Huan-level error: as a proxy for Bayes error

  • Training error:

    • training error - human -level error = "Avoidable bias"
    • Training bigger modeel
    • Train longer/better optimization algorithms
    • NN architecture/hyperparameters search
  • Dev error :

    • Dev error - Training error = "variance "
    • More data
    • Regularization: L2, dropout, data augmentation,
    • NN architecture/hyperparameters search.

Two fundamental assumptions of supervised learning

  • You can fit the training set pretty well.

  • The training set performance generalizes pretty well to the dev/test set.

Error analysis

Build your system quickly

  • Set up dev/test set and metric

  • Build inital system quicily

  • Use Bias/Variance analytiss & Error analytiss ot prioritize next steps.

Bias/Variance on mismatched training and dev/test sets

General Speech Recognition Rearer Mirror Speech Data
Human Level error "human level" 4% <<< Avoidable error
Error on the examples trained on "training error" 7% <<< Variance
Eroor on the examples not trained on "training-dev error" 10% <<<< Data Mismatch >>>> "Dev/test error" 6%

When transfer learning makes sense in DL?

  • task A and task B have the same input X
  • you have a lot of data for task A than task B
  • low level features from A could be helpful for task B

results matching ""

    No results matching ""