Sparsemax: When Softmax Goes on a Diet
·31 words·1 min
This is a great paper to look in to if your classification problem involves huge number of classes:
From Softmax to Sparsemax:A Sparse Model of Attention and Multi-Label Classification https://arxiv.org/pdf/1602.02068.pdf