Implemented as a Multi-Layer Perceptron to classify handwritten Digits (0-9)

[Annotated Notebook]

Model Architecture and Results

The model comprises a flattening layer and three linear layers ((256, 64) hidden dimensions) with relus to approximate non-linearity. It achieves 95.6% accuracy after 15 training epochs and batch size = 64. Taining and Test MNIST datasets are loaded with PyTorch dataloaders.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .