SciELO - Scientific Electronic Library Online

 
vol.32 número2Benign interpolation of noise in deep learningHt-index for empirical evaluation of the sampled graph-based Discrete Pulse Transform índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Artigo

Indicadores

Links relacionados

  • Em processo de indexaçãoCitado por Google
  • Em processo de indexaçãoSimilares em Google

Compartilhar


South African Computer Journal

versão On-line ISSN 2313-7835
versão impressa ISSN 1015-7999

Resumo

DAVEL, Marelie H.. Using Summary Layers to Probe Neural Network Behaviour. SACJ [online]. 2020, vol.32, n.2, pp.102-123. ISSN 2313-7835.  http://dx.doi.org/10.18489/sacj.v32i2.861.

No framework exists that can explain and predict the generalisation ability of deep neural networks in general circumstances. In fact, this question has not been answered for some of the least complicated of neural network architectures: fully-connected feedforward networks with rectified linear activations and a limited number of hidden layers. For such an architecture, we show how adding a summary layer to the network makes it more amenable to analysis, and allows us to define the conditions that are required to guarantee that a set of samples will all be classified correctly. This process does not describe the generalisation behaviour of these networks, but produces a number of metrics that are useful for probing their learning and generalisation behaviour. We support the analytical conclusions with empirical results, both to confirm that the mathematical guarantees hold in practice, and to demonstrate the use of the analysis process.CATEGORIES: Computing methodologies ~ Neural networks Theory of computation ~ Machine learning theory

Palavras-chave : deep learning; machine learning; learning theory; generalisation.

        · texto em Inglês     · Inglês ( pdf )

 

Creative Commons License Todo o conteúdo deste periódico, exceto onde está identificado, está licenciado sob uma Licença Creative Commons