Impact of Weight Initialization Techniques on Neural Network Efficiency and Performance: A Case Study with MNIST Dataset

Chitra Desai

doi:10.18535/ijecs/v13i04.4809

Articles

Total : PDF: 16 | Total views: 16

Chitra Desai,

Article Date Published : 9 April 2024 | Page No.: 26115-26120 |

DOI https://doi.org/10.18535/ijecs/v13i04.4809

Online Metrics

Abstract

This manuscript investigates the impact of weight initialization on the efficiency and performance of deep learning models, focusing on a specific neural network architecture applied to the MNIST dataset of handwritten digits. It highlights the importance of appropriate weight initialization for achieving rapid convergence and ensuring strong generalization, which are critical for the effective learning of complex data patterns. The study evaluates several weight initialization methods, including random, Xavier/Glorot, and He techniques, within the context of a neural network consisting of a flatten layer, a dense layer with 128 neurons using the ReLU activation function, and a final dense output layer. The examination is rooted in the foundational theories behind these strategies, assessing their effect on the training process and subsequent model performance. Through a detailed analysis, this research aims to clarify the role of these weight initialization techniques in enhancing the convergence speed and overall performance of the neural network on tasks like image recognition. By merging empirical observations with theoretical insights, the study seeks to offer guidance for the strategic selection of weight initialization methods, thereby optimizing the training and effectiveness of deep learning models.

Keywords: Neural Networks, Weight Initialization, MNIST Dataset, Xavier/Glorot Initialization, He Initialization, Model Performance, Training Efficiency.

Downloads

Download data is not yet available.

Comments & Peer Review

Author's Affiliation

Chitra Desai

Department of Computer Science, National Defence Academy, Pune
Google Scholar

Copyrights & License

International Journal of Engineering and Computer Science, 2024.

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Article Details

Issue: Vol. 13 No. 04 (2024)

Page No.: 26115-26120

Section: Articles

DOI: https://doi.org/10.18535/ijecs/v13i04.4809

How to Cite

Desai, C. (2024). Impact of Weight Initialization Techniques on Neural Network Efficiency and Performance: A Case Study with MNIST Dataset. International Journal of Engineering and Computer Science, 13(04), 26115–26120. https://doi.org/10.18535/ijecs/v13i04.4809

Download Citation

HTML Viewed - 38 Times
PDF Downloaded - 16 Times

PDF

Downloads

PDF

Impact of Weight Initialization Techniques on Neural Network Efficiency and Performance: A Case Study with MNIST Dataset

Abstract

Downloads

Author's Affiliation

Chitra Desai

Copyrights & License

Article Details

How to Cite

Download Citation

Downloads

Sections

We Recommend

Most read articles by the same author(s)