UDK 519.87
O THE PROBLEM OF GENERATION SAMPLE IN SOLVING THE PROBLEM OF NON INTERIAL PROCESSES IDENTIFICATION
E. A. Chzhan
Siberian Federal University 79/10, Svobodny Av., Krasnoyarsk, 660041, Russian Federation Е-mail: ekach@list.ru
The problem of improving the quality of the original data in the identification of H-processes is considered. There is stochastic dependence between input variables of such processes so this process precedes not at all the regulated area but only at some of its subdomain. This fact leads to some features that must be considered when identifying. If there is large number of a priori data when building models, you can use the methods of identification in the “broad” sense. However, if there is sufficient a priori information about the object being studied, it is necessary to apply the methods of identification in the “narrow” sense. These methods include nonparametric estimation of regression function from observations. The quality of solving the problem of identification depends on the quality of input data. It is advisable to conduct a preliminary analysis of data to identify and address all the deficiencies in the sample. Under the preliminary analysis of the data is taken to mean filling gaps in observations and eliminating emissions. However, the sample may have other flaws, that will be discussed below, which adversely affect the accuracy of estimation, and, in some cases, lead to the fact that the resulting model will be inadequate to the investigated process. If the point of the original sample in the field of process located patchy, there are low-pressure range and lack of observations, in the areas of reconstruction accuracy is low. Due to the properties of nonparametric estimation, which belongs to the class of local approximations, projections can not be given at the lack of observations subdomain. To resolve all these shortcomings we propose an algorithm to obtain a working sample by generating new points in regions where the density is low in comparison with other areas. After generating new working sample, the quality of estimation is significantly improved, as evidenced by the results of numerical experiments. This kind of tasks are relevant and can be used in solving the problem of recognition in various fields, where the classification accuracy is important, for example, in the diagnosis of space propulsion, electrical, etc.
identification, noninterial processes H-process, sample, data analysis, non-parametric modeling.
References
  1. Eykhoff P. Osnovy identifikatsii sistem upravleniya [System Identification Parameter and State Estimation]. Moscow, Mir Publ., 1975, 683 p.
  2. Tsypkin Ya. Z. Osnovy informatsionnoy teorii identifikatsii [The foundation of information identification theory]. Moscow, Nauka Publ., 1984, 320 p.
  3. Medvedev A. V. [Some remarks on the H-model inertialess processes with delay]. Vestnik SibGAU. 2014, No. 2 (54), P. 50–55 (In Russ.).
  4. Korneeva A. A., Sergeev A. N., Chzhan E. A. [Investigation of nonparametric models of tubular type processes]. Vestnik SibGAU. 2012, No. 5 (45), P. 44–49 (In Russ.).
  5. Medvedev A. V. Neparametricheskie sistemy adaptatsii [Nonparametric adaptation systems]. Novosibirsk, Nauka Publ., 1983, 174 p.
  6. Nadaraya E. A. Neparametricheskie otsenki plotnosti veroyatnosti i krivoy regresii [Non-parametric estimation of the probability density and the regression curve]. Tbilisi, Tbilisi University Publ., 1983, 194 p.
  7. Khardle V. Prikladnaya neparametricheskaya regressiya [Applied Nonparametric Regression]. Moscow, Mir Publ., 1993, 352 p.
  8. Zagoruyko N. G. Prikladnye metody analiza dannykh i znaniy [Applied methods of data and knowledge analysis]. Novosibirsk, IM SO RAN Publ., 1999. 264 p.
  9. Gelman A. et al. Bayesian data analysis. CRC press, 2013.
  10. Kantardzic M. Data mining: concepts, models, methods, and algorithms. John Wiley & Sons, 2011.
  11. Fisher C. W., Kingma B. R. Criticality of data quality as exemplified in two disasters. Information & Management. 2001, Vol. 39, No 2, P. 109–116.
  12. Gillies D., Thornley D., Bisdikian C. Probabilistic approaches to estimating the quality of information in military sensor networks. The Computer Journal. 2010, Vol. 53, No 5, P. 493–502.
  13. Vaganov M. A., Moskalets O. D., Kulakov S. V. [Multichannel spectral devices for the diagnosis of liquid rocket engines]. Informatsionno-upravlyayushchie sistemy, No 1 (62), P. 2–6 (In Russ.).
  14. Mukhin S. V., Rebenkov A. V. [Prospects for the development of information-measuring and control systems for liquid rocket engine test stand at the chemical plant-branch of “Krasmash”]. Reshetnevskie chteniya : materialy XIV Mezhdunar. nauch. konf. [Reshetnev Readings: materials XIV Intern. scientific. Conf.]. Krasnoyarsk, 10-12, November 2010, Part 1. P. 261–266 (In Russ.).
  15. Orlov V. I., Sergeeva N. A. [Nonparametric diagnosis and control of the process of making electronic component]. Vestnik SibGAU. 2013, No. 2 (48), P. 70–75 (In Russ.).

Chzhan Ekaterina Anatol’evna – postgraduate student, Siberian Federal University. E-mail: ekach@list.ru