https://www.cs.cmu.edu/~rsalakhu/papers/oneshot1.pdf
1-1. Upsides of this approach
- Capable of learning generic image features useful for making predictions about unknown class distributions even when very few examples are
available. - Easily trained using standard optimization techniques on pairs sampled
from the source data. - Provide a competitive approach that does not rely upon domain-specific
knowledge by instead exploiting deep learning techniques.
1-2. Learning Strategy
- Learn a neural network that can discriminate between the class-identity
of image pairs (standard verification task) - Output of model is the probability that input images are belong to the
same class.
1-3. Test Phase
- Along the given one images for each novel classes and given test
images for evaluation, evaluate probability that given each test image
are belong to the same class for each novel classes. - Predict by class with the highest probability.
2-1. Model Architecture
- Siamese Network: Use two identical networks that shares weights, and measure distance between embeddings to calculate similarity between them.
- Verification Stage: Train a ConvNet so that it successfully outputs appropriate embedding of input image.
- Classification Stage: Given K-way 1-Shot Classifcation Problem, classify a image with highest similarity.
2-2. Prediction Vector
- Prediction is sigmoid output of weighted L1-Distance
2-3. Loss Function
- L1-Regularized BCE Loss
2-4. Optimization
- Gradient Descent with Momentum and Regularizer
- Momentum Learning Schedule
- Weight Intialization with Normal Distribution
- Hyperparameter Optimization
- Affine Distortion
3-1. Experiments
- Used Omniglot dataset, a dataset containing 1623 characters from 50 different alphabets,
each one hand-drawn by a group of 20 different people.
- Hierarchical Bayesian Program Learning (HBPL), needs information of stroke order. Unlike HBPL, Convolutional Siamese Net does not need any domain knowledge.
- Tried experiment of genearlization to MNIST dataset, while learned only Omniglot Dataset.