The Softmax output function transforms a previous layer's output into a vector of probabilities. It is commonly used for multiclass classification. Given an input vector $x$ and a weighting vector $w$ we have:
$$ P(y=j \mid{x}) = \frac{e^{x^{T}w_{j}}}{\sum^{K}_{k=1}e^{x^{T}wk}} $$
Paper | Code | Results | Date | Stars |
---|
Task | Papers | Share |
---|---|---|
Language Modelling | 47 | 6.37% |
Retrieval | 30 | 4.07% |
Semantic Segmentation | 29 | 3.93% |
Question Answering | 26 | 3.52% |
Large Language Model | 25 | 3.39% |
Image Segmentation | 14 | 1.90% |
Object Detection | 14 | 1.90% |
In-Context Learning | 12 | 1.63% |
Sentence | 11 | 1.49% |
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |