Review of advances in neural networks: Neural design technology stack

doi:10.1016/j.neucom.2015.02.092

Neurocomputing

Volume 174, Part A, 22 January 2016, Pages 31-41

https://doi.org/10.1016/j.neucom.2015.02.092 Get rights and content

Abstract

This review provides a high-level synthesis of significant recent advances in artificial neural network research, as well as multi-disciplinary concepts connected to the far-reaching goal of obtaining intelligent systems. We assume that a global outlook of these interconnected fields can benefit researchers by providing alternative viewpoints. Therefore, we present different network and neuron models, we discuss model parameters and the means to obtain them, and we draw a quick outline of information encoding, before proceeding to an overview of the relevant learning mechanisms, ranging from established approaches to novel ideas. We specifically focus on comparing the classical artificial model with the biologically-feasible spiking neuron, and we take this comparison further into a discussion on the biological plausibility of various learning approaches.

Section snippets

Introduction and background

Recently, we have been witnessing increasing interest in understanding how the brain functions and how it could be modeled. This momentum is fueled by major research initiatives, including the Human Brain Project [44], the BRAIN Initiative, as well as significant commercial efforts, such as IBM Watson [20].

In this paper we provide a high-level overview of the state-of-the-art in neural networks, the main building block of the brain. Computer scientists typically focus on artificial neural

Neural network definitions

From the structural point of view, a brain consists of a network of neurons, densely interconnected via synapses – a structure which artificial neural networks are attempting to recreate. In this section we present different neuron and network models, as well as discuss the information and feature encoding in these networks.

Establishing model parameters

A neural network is a meta-model for computation. Applying it to solve a concrete problem requires a particular set of parameters, whose tuning and life cycle are presented in the following subsections.

Learning

Learning is a data-driven mechanism for obtaining model parameters. In this section, we describe the learning process and we make a classification of learning algorithms, with a focus on a number of interesting learning trends.

Machine learning techniques aim to shift the work burden from human effort to machine computation, for specific classes of problems. With the right algorithm, the manual effort of designing solutions can be traded off for memory and computational power. Algorithms, in the

Advancing neural networks research

A number of recent advancements have helped neural networks to regain mainstream attention. Firstly, novel hardware has made it possible to verify concepts at a much larger scale, e.g. by simulating a system with 10⁹ neurons [2], comparable in scale to a cat׳s brain. Secondly, the unsupervised feature learning paradigm offered a logic that could utilize large unpreprocessed datasets, providing results such as a self-emerging cat image detector [43]. Having already discussed the latter in

Summary

In this paper, we have provided a high-level overview of current trends in neural network research. By starting from the definitions of neural network models, we compare the artificial neuron model with the biologically-plausible spiking neuron. Furthermore, we explain the basic concepts behind the encoding and decoding of information in a neural network model, be it an artificial mathematical abstraction, or a biological neuroscience model. Moreover, we present the model parameters that need

Adela-Diana Almási obtained her Bachelor׳s and Master׳s degrees in Computer Science from the “Politehnica” University of Bucharest, Romania, where she is currently a PhD student under the supervision of Prof. Valentin Cristea. She is conducting her doctoral research in collaboration with the IBM Research Laboratory in Zurich, Switzerland. Her main research interests are machine learning, neural networks and data analytics.

References (68)

V.A. Bender et al.
A dynamic spatial gradient of Hebbian learning in dendrites
Neuron
(2006)
S.M. Bohte et al.
Error-backpropagation in temporally encoded networks of spiking neurons
Neurocomputing
(2002)
T. Buschman et al.
Synchronous oscillatory neural ensembles for rules in the prefrontal cortex
Neuron
(2012)
L. Chen et al.
Chaotic simulated annealing by a neural network model with transient chaos
Neural Netw.
(1995)
D. Cireşan et al.
Multi-column deep neural network for traffic sign classification
Neural Netw.
(2012)
R. van Rullen et al.
Face processing using one spike per neurone
Biosystems
(1998)
E.D. Adrian et al.
The impulses produced by sensory nerve-endings. Part II: the response of a single end-organ
J. Physiol.
(1926)
R. Ananthanarayanan, S.K. Esser, H.D. Simon, D.S. Modha, The cat is out of the bag, In: Proceedings of the Conference...
A. Borst et al.
Information theory and neural coding
Nat. Neurosci.
(1999)
D. Brüderle et al.
A comprehensive workflow for general-purpose neural modeling with highly configurable neuromorphic hardware systems
Biol. Cybern.
(2011)

M. Collins, Discriminative training methods for hidden Markov models: theory and experiments with perceptron...

G.E. Dahl et al.

Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition

IEEE Trans. Audio Speech Lang. Process.

(2012)

O.E. David, I. Greental, Genetic algorithms for evolving deep neural networks, In: Proceedings of the 2014 Conference...

P. Dayan et al.

Theoretical neuroscience: computational and mathematical modeling of neural systems

J. Cognit. Neurosci.

(2003)

D. Debanne et al.

Axon physiology

Physiol. Rev.

(2011)

W. Duch, Coloring black boxes: visualization of neural network decisions, in: 2003 Proceedings of the International...

C. Eliasmith

How to Build a BrainA Neural Architecture for Biological Cognition

(2013)

C. Eliasmith et al.

Neural EngineeringComputation, Representation, and Dynamics in Neurobiological Systems

(2004)

C. Eliasmith et al.

A large-scale model of the functioning brain

Science

(2012)

C. Ferreira

Designing neural networks using gene expression programming

Appl. Soft Comput. Technol.: Chall. Complex.

(2006)

D. Ferrucci et al.

Building Watsonan overview of the deepqa Project

AI Mag.

(2010)

N. Fremaux et al.

Functional requirements for reward-modulated spike-timing-dependent plasticity

J. Neurosci.

(2010)

P. Gastaldo et al.

Combining ELMs with random projections

IEEE Intell. Syst.

(2013)

W. Gerstner et al.

A neuronal learning rule for sub-millisecond temporal coding

Nature

(1996)

W. Gerstner et al.

Spiking Neuron ModelsSingle Neurons, Populations, Plasticity

(2002)

W. Gerstner et al.

How good are neuron models?

Science

(2009)

W. Gerstner et al.

Theory and simulation in neuroscience

Science

(2012)

A. Graves et al.

A novel connectionist system for unconstrained handwriting recognition

IEEE Trans. Pattern Anal. Mach. Intell.

(2009)

A. Graves, A.-r. Mohamed, G. Hinton, Speech recognition with deep recurrent neural networks, in: 2013 IEEE...

A. Gupta, L.N. Long, Character recognition using spiking neural networks, in: 2007 Neural Networks, IJCNN, 2007, pp....

S.-J. Han et al.

Evolutionary neural networks for anomaly detection based on the behavior of a program

IEEE Trans. Syst. Man Cybern. Part B: Cybern.

(2005)

S. L. Hill, Y. Wang, I. Riachi, F. Schürmann, H. Markram, Statistical connectivity provides a sufficient foundation for...

M.L. Hines et al.

ModelDBa database to support computational neuroscience

J. Comput. Neurosci.

(2004)

G. Hinton et al.

A fast learning algorithm for deep belief nets

Neural Comput.

(2006)

Cited by (64)

An integrate-and-fire neuron with capacitive trans-impedance amplifier for improving linearity in Spiking Neural Networks
2022, Solid-State Electronics
In this paper, we propose a novel neuron circuit with capacitive trans-impedance amplifier (CTIA) integrator to enhance the linearity of pre-synaptic current summation. The proposed neuron circuit achieved an improvement compared to the conventional neuron circuit with a current mirror, thereby maintaining the input node of a neuron circuit at virtual ground by the negative feedback of an amplifier. The relative potential difference is integrated into the membrane capacitor of CTIA feedback loop with a negative integration. From the simulation results, we confirmed that the proposed neuron circuit successfully improves output linearity for a wide range of the input current level.
Challenges in implementing data-driven approaches for building life cycle energy assessment: A review
2022, Renewable and Sustainable Energy Reviews
Over the last few decades, the construction sector's energy consumption has increased tremendously. Buildings consume both embodied energy (EE) and operational energy (OE) during their life cycle. EE is consumed by processes associated with construction, whereas OE is spent operating the building. Studies show that improving the operational efficiency of a building may have serious implications for EE. Building life cycle energy assessments (LCEA) is, therefore, essential to understanding the dichotomy between EE and OE. In recent years, increased availability and accessibility of large-scale data have made data-driven approaches a popular choice for building performance assessments. In this context, numerous review articles have highlighted and tracked current trends in building load prediction methods. While this work is significant, there remains a lack of reviews focusing on data-driven approaches from a building life cycle energy perspective. In this paper, we conduct a systematic review of literature to identify key factors hindering the application of machine learning techniques specifically for building LCEA. They include: (i) issues of data collection, quality, and availability; (ii) lack of standardized methodologies; and (iii) temporal representativeness and granularity of prediction. Finally, we discuss potential solutions, future directions, and research opportunities for data-driven LCEA research.
Deep learning for predictions of hydrolysis rates and conditional molecular design of esters
2021, Journal of the Taiwan Institute of Chemical Engineers
The hydrolysis rate of an ester is essential for the choice of materials in sustainable and eco-friendly applications.
In this work, the autoencoder (AE) model has been constructed to predict the hydrolysis rate by inputting SMILES and partial charges. Moreover, the conditional autoencoder (CAE) model has been developed to design chemical structures of esters that possess hydrolysis rates close to the desired value.
By implementing the SMILES enumeration technique and the attention mechanism, our AE model exhibits significantly better performance than SPARC based on the root mean square error. For six biodegradable esters that have no experimental rate constants, the predictions of our AE model are in agreement with those based on the activation energies calculated from Dmol³. To design an ester satisfying the desired conditions, our CAE model demonstrates its capability of providing the best candidates of esters and their rate constants based on structural similarity and the least difference of hydrolysis rates. The derived structures are similar to the desired structure and their rate constants are close to the targeted value.
Development of a Novel Feedforward Neural Network Model Based on Controllable Parameters for Predicting Effluent Total Nitrogen
2021, Engineering
The problem of effluent total nitrogen (TN) at most of the wastewater treatment plants (WWTPs) in China is important for meeting the related water quality standards, even under the condition of high energy consumption. To achieve better prediction and control of effluent TN concentration, an efficient prediction model, based on controllable operation parameters, was constructed in a sequencing batch reactor process. Compared with previous models, this model has two main characteristics: ① Superficial gas velocity and anoxic time are controllable operation parameters and are selected as the main input parameters instead of dissolved oxygen to improve the model controllability, and ② the model prediction accuracy is improved on the basis of a feedforward neural network (FFNN) with algorithm optimization. The results demonstrated that the FFNN model was efficiently optimized by scaled conjugate gradient, and the performance was excellent compared with other models in terms of the correlation coefficient (R). The optimized FFNN model could provide an accurate prediction of effluent TN based on influent water parameters and key control parameters. This study revealed the possible application of the optimized FFNN model for the efficient removal of pollutants and lower energy consumption at most of the WWTPs.
Task-Adaptive Neuromorphic Computing Using Reconfigurable Organic Neuristors with Tunable Plasticity and Logic-in-Memory Operations
2024, Journal of Physical Chemistry Letters
Diagnostic reasoning in the age of artificial intelligence: Synergy or opposition?
2024, Journal of Hospital Medicine

View all citing articles on Scopus

Stanisław Woźniak received his BSc and MSc degrees in Computer Science from Poznan University of Technology, Poland. Currently he is pursuing a PhD degree at École Polytechnique Fédérale de Lausanne, Switzerland, in collaboration with IBM Research – Zurich, Switzerland. His research interests include neural networks, machine learning, computational neuroscience and cognitive systems.

Valentin Cristea is the Head of the Computer Science and Engineering Department and Professor at the Politehnica University of Bucharest. His main fields of expertise are Computer networks, E-Services, and Large Scale Distributed Systems, topics on which he teaches courses and supervises PhD students. He is the Director of the National Center for Information Technology, leader of the CoLaborator, Distributed Systems and Grid, and e-Business/e-Government laboratories. He has a long experience in the development and management of international and national research projects on e-services, dependable large scale distributed systems, Grid and Cloud computing, and smart environments. He received the IBM Faculty Award in 2003 and 2011, he is an IEEE and ACM member and a Phare IT expert.

Yusuf Leblebici received the B.Sc. and M.Sc. degrees in electrical engineering from Istanbul Technical University, Istanbul, Turkey, in 1984 and 1986, respectively, and the Ph.D. degree in electrical and computer engineering from the University of Illinois, Urbana-Champaign (UIUC), in 1990. Since 2002, he is a Chair Professor at the Swiss Federal Institute of Technology in Lausanne (EPFL), and director of Microelectronic Systems Laboratory. His research interests include design of high-speed CMOS digital and mixed-signal integrated circuits, computer-aided design of VLSI systems, intelligent sensor interfaces, modeling and simulation of semiconductor devices, and VLSI reliability analysis. He is the coauthor of six textbooks, as well as more than 300 articles published in various journals and conferences. He is a Fellow of IEEE since 2010, and he has been elected as Distinguished Lecturer of the IEEE Circuits and Systems Society for 2010–2011.

Ton Engbersen has been with the IBM Research since 1980. His career spanned such diverse areas as Image processing, chip design, communications technology, server technology, legacy management, Innovation in Outsourcing and Data center Energy management. Throughout the years he has held a range of management positions in R&D in Switzerland and in the US. As member of the IBM Academy of Technology he led the European branch from 2009 to 2011. Ton has published more than 50 articles in refereed scientific journals. Currently he is the Scientific Director for the ASTRON-IBM Center of Exascale Technology, and leading the DOME Project http://www-03.ibm.com/press/us/en/pressrelease/37361.wss.

View full text

Review of advances in neural networks: Neural design technology stack

Abstract

Section snippets

Introduction and background

Neural network definitions

Establishing model parameters

Learning

Advancing neural networks research

Summary

Neuron

Neurocomputing

Neuron

Neural Netw.

Neural Netw.

Biosystems

The impulses produced by sensory nerve-endings. Part II: the response of a single end-organ

J. Physiol.

Information theory and neural coding

Nat. Neurosci.

A comprehensive workflow for general-purpose neural modeling with highly configurable neuromorphic hardware systems

Biol. Cybern.

Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition

IEEE Trans. Audio Speech Lang. Process.

Theoretical neuroscience: computational and mathematical modeling of neural systems

J. Cognit. Neurosci.

Axon physiology

Physiol. Rev.

How to Build a BrainA Neural Architecture for Biological Cognition

Neural EngineeringComputation, Representation, and Dynamics in Neurobiological Systems

A large-scale model of the functioning brain

Science

Designing neural networks using gene expression programming

Appl. Soft Comput. Technol.: Chall. Complex.

Building Watsonan overview of the deepqa Project

AI Mag.

Functional requirements for reward-modulated spike-timing-dependent plasticity

J. Neurosci.

Combining ELMs with random projections

IEEE Intell. Syst.

A neuronal learning rule for sub-millisecond temporal coding

Nature

Spiking Neuron ModelsSingle Neurons, Populations, Plasticity

How good are neuron models?

Science

Theory and simulation in neuroscience

Science

A novel connectionist system for unconstrained handwriting recognition

IEEE Trans. Pattern Anal. Mach. Intell.

Evolutionary neural networks for anomaly detection based on the behavior of a program

IEEE Trans. Syst. Man Cybern. Part B: Cybern.

ModelDBa database to support computational neuroscience

J. Comput. Neurosci.

A fast learning algorithm for deep belief nets

Neural Comput.