International Center UNESCO for Information Technologies of NASU  
Description
Algorithms
Software
Examples
History
Articles
   
Welcome to the Group Method of Data Handling site!

 








 

 






eXTReMe Tracker

Group Method of Data Handling* was applied in a great variety of areas for data mining and knowledge discovery, forecasting and systems modeling, optimization and pattern recognition. Inductive GMDH algorithms give possibility to find automatically interrelations in data, to select optimal structure of model or network and to increase the accuracy of existing algorithms.

Criterion Characteristic This original self-organizing approach is substantially different from deductive methods used commonly for modeling. It has inductive nature - it finds the best solution by sorting-out of possible variants.

By sorting of different solutions GMDH networks aims to minimize the influence of the author on the results of modeling. Computer itself finds the structure of the model and the laws which act in the system.

Group Method of Data Handling is a set of several algorithms for different problems solution. It consists of parametric, clusterization, analogues complexing, rebinarization and probability algorithms. This inductive approach is based on sorting-out of gradually complicated models and selection of the best solution by minimum of external criterion characteristic. Not only polynomials but also non-linear, probabilistic functions or clusterizations are used as basic models.

GMDH approach can be useful because:

  • The optimal complexity of model structure is found, adequate to level of noise in data sample. For real problems solution with noised or short data, simplified forecasting models are more accurate.
  • The number of layers and neurons in hidden layers, model structure and other optimal NN parameters are determined automatically.
  • It guarantees that the most accurate or unbiased models will be found - method doesn't miss the best solution during sorting of all variants (in given class of functions).
  • As input variables are used any non-linear functions or features, which can influence the output variable.
  • It automatically finds interpretable relationships in data and selects effective input variables.
  • GMDH sorting algorithms are rather simple for programming.
  • TMNN neural nets are used to increase the accuracy of another modelling algorithms.
  • Method uses information directly from data sample and minimizes influence of apriori author assumptions about results of modeling.
  • Approach gives possibility to find unbiased physical model of object (law or clusterization) - one and the same for future samples.
Since 1968 many investigations and applications of GMDH were conducted in many countries. It was implemented in many commercial software products.

* GMDH is known also as Polynomial Neural Networks, Abductive and Statistical Learning Networks
GMDH News
  • The new website OpenGMDH use wiki technology to develop GMDH software, propose to exchange articles and discuss inductive methods


    Publications
  • Proceedings of the IWIM'2007, devoted to the GMDH approach are added
  • Biographic materials devoted to 90 anniversary of Alexey Ivakhnenko, the author of GMDH were added

    Software
  • A Mathematica program for the implementation of GMDH algorithm was proposed
  • GMDH PNN algorithm for on-line computation

    Books
  • The three books in russian and the book of Madala H.R., Ivakhnenko A.G. 'Inductive Learning Algorithms for Complex Systems Modeling.' can be downloaded
  • Book of Mueller J.A., Lemke F. 'Self-Organising Data Mining. An Intelligent Approach To Extract Knowledge From Data' can be purchased and downloaded

    Tutorials
  • Main ideas are described in PowerPoint presentation