Towards Efficient Convolutional Neural Architecture Design
Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen:
https://doi.org/10.48693/113
https://doi.org/10.48693/113
Titel: | Towards Efficient Convolutional Neural Architecture Design |
Autor(en): | Richter, Mats L. |
ORCID des Autors: | https://orcid.org/0000-0002-0991-3047 |
Erstgutachter: | Prof. Dr. Gunther Heidemann |
Zweitgutachter: | Prof. Dr. Julius Schöning Prof. Dr. Dimitris Pinotsis |
Zusammenfassung: | The design and adjustment of convolutional neural network architectures is an opaque and mostly trial and error-driven process. The main reason for this is the lack of proper paradigms beyond general conventions for the development of neural networks architectures and lacking effective insights into the models that can be propagated back to design decision. In order for the task-specific design of deep learning solutions to become more efficient and goal-oriented, novel design strategies need to be developed that are founded on an understanding of convolutional neural network models. This work develops tools for the analysis of the inference process in trained neural network models. Based on these tools, characteristics of convolutional neural network models are identified that can be linked to inefficiencies in predictive and computational performance. Based on these insights, this work presents methods for effectively diagnosing these design faults before and during training with little computational overhead. These findings are empirically tested and demonstrated on architectures with sequential and multi-pathway structures, covering all the common types of convolutional neural network architectures used for classification. Furthermore, this work proposes simple optimization strategies that allow for goal-oriented and informed adjustment of the neural architecture, opening the potential for a less trial-and-error-driven design process. |
URL: | https://doi.org/10.48693/113 https://osnadocs.ub.uni-osnabrueck.de/handle/ds-202205106814 |
Schlagworte: | Deep Learning, Neural Architecture Design, Computer Vision, Convolutional Neural Networks |
Erscheinungsdatum: | 10-Mai-2022 |
Lizenzbezeichnung: | Attribution 3.0 Germany |
URL der Lizenz: | http://creativecommons.org/licenses/by/3.0/de/ |
Publikationstyp: | Dissertation oder Habilitation [doctoralThesis] |
Enthalten in den Sammlungen: | FB08 - E-Dissertationen |
Dateien zu dieser Ressource:
Datei | Beschreibung | Größe | Format | |
---|---|---|---|---|
thesis_richter.pdf | Präsentationsformat | 43,43 MB | Adobe PDF | thesis_richter.pdf Öffnen/Anzeigen |
Diese Ressource wurde unter folgender Copyright-Bestimmung veröffentlicht: Lizenz von Creative Commons