American Journal of Software Engineering and Applications

| Peer-Reviewed |

R Language in Data Mining Techniques and Statistics

Received: 11 February 2013    Accepted:     Published: 20 February 2013
Views:       Downloads:

Share This Article

Abstract

Data mining is a set of techniques and methods relating to the extraction of knowledge from large amounts of data (through automatic or semi-automatic methods) and further scientific, industrial or operational use of that knowledge. Data mining is closely related to the statistics as an applied mathematical discipline with an analysis of data that could be defined as the extraction of useful information from data.The only difference between the two disciplines is that data mining is a new discipline that is related to significant or large data sets. R is an object-oriented programming language. This means that everything what is done with R can be saved as an object. Every object has a class. It describes what the object contains and what each function does. Application of R as a programming language and statistical software is much more than a supplement to Stata, SAS, and SPSS. Although it is more difficult to learn, the biggest advantage of R is its free-of-charge feature and the wealth of specialized application packages and libraries for a huge number of statistical, mathematical and other methods. R is a simple, but very powerful data mining and statistical data processing tool and once "discovered", it provides users with an entirely new, rich and powerful tool applicable in almost every field of research

DOI 10.11648/j.ajsea.20130201.12
Published in American Journal of Software Engineering and Applications (Volume 2, Issue 1, February 2013)
Page(s) 7-12
Creative Commons

This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.

Copyright

Copyright © The Author(s), 2024. Published by Science Publishing Group

Keywords

R Language, Data Mining Techniques, Statistics

References
[1] M. J. Crawley, The R book, Imperial College London at Silwood Park, UK, John Wiley and Sons, Ltd 2007.
[2] Dalgaard, P. Introductory Statistics with R, New York, Springer-Verlag, 2002.
[3] Krause, A. and Olson, M. The Basics of S and S-PLUS, New York, Springer-Verlag, 2000.
[4] McCulloch, C.E. and Searle, S.R. Generalized, Linear and Mixed Models, New York: John, 2001.
Author Information
  • Montenegro Business School, "Mediterranean" University, Montenegro; Dipartimento di Informatica, Università degli Studi di Bari "Aldo Moro", Italy

Cite This Article
  • APA Style

    Sonja Pravilovic. (2013). R Language in Data Mining Techniques and Statistics. American Journal of Software Engineering and Applications, 2(1), 7-12. https://doi.org/10.11648/j.ajsea.20130201.12

    Copy | Download

    ACS Style

    Sonja Pravilovic. R Language in Data Mining Techniques and Statistics. Am. J. Softw. Eng. Appl. 2013, 2(1), 7-12. doi: 10.11648/j.ajsea.20130201.12

    Copy | Download

    AMA Style

    Sonja Pravilovic. R Language in Data Mining Techniques and Statistics. Am J Softw Eng Appl. 2013;2(1):7-12. doi: 10.11648/j.ajsea.20130201.12

    Copy | Download

  • @article{10.11648/j.ajsea.20130201.12,
      author = {Sonja Pravilovic},
      title = {R Language in Data Mining Techniques and Statistics},
      journal = {American Journal of Software Engineering and Applications},
      volume = {2},
      number = {1},
      pages = {7-12},
      doi = {10.11648/j.ajsea.20130201.12},
      url = {https://doi.org/10.11648/j.ajsea.20130201.12},
      eprint = {https://download.sciencepg.com/pdf/10.11648.j.ajsea.20130201.12},
      abstract = {Data mining is a set of techniques and methods relating to the extraction of knowledge from large amounts of data (through automatic or semi-automatic methods) and further scientific, industrial or operational use of that knowledge. Data mining is closely related to the statistics as an applied mathematical discipline with an analysis of data that could be defined as the extraction of useful information from data.The only difference between the two disciplines is that data mining is a new discipline that is related to significant or large data sets. R is an object-oriented programming language. This means that everything what is done with R can be saved as an object. Every object has a class. It describes what the object contains and what each function does. Application of R as a programming language and statistical software is much more than a supplement to Stata, SAS, and SPSS. Although it is more difficult to learn, the biggest advantage of R is its free-of-charge feature and the wealth of specialized application packages and libraries for a huge number of statistical, mathematical and other methods. R is a simple, but very powerful data mining and statistical data processing tool and once "discovered", it provides users with an entirely new, rich and powerful tool applicable in almost every field of research},
     year = {2013}
    }
    

    Copy | Download

  • TY  - JOUR
    T1  - R Language in Data Mining Techniques and Statistics
    AU  - Sonja Pravilovic
    Y1  - 2013/02/20
    PY  - 2013
    N1  - https://doi.org/10.11648/j.ajsea.20130201.12
    DO  - 10.11648/j.ajsea.20130201.12
    T2  - American Journal of Software Engineering and Applications
    JF  - American Journal of Software Engineering and Applications
    JO  - American Journal of Software Engineering and Applications
    SP  - 7
    EP  - 12
    PB  - Science Publishing Group
    SN  - 2327-249X
    UR  - https://doi.org/10.11648/j.ajsea.20130201.12
    AB  - Data mining is a set of techniques and methods relating to the extraction of knowledge from large amounts of data (through automatic or semi-automatic methods) and further scientific, industrial or operational use of that knowledge. Data mining is closely related to the statistics as an applied mathematical discipline with an analysis of data that could be defined as the extraction of useful information from data.The only difference between the two disciplines is that data mining is a new discipline that is related to significant or large data sets. R is an object-oriented programming language. This means that everything what is done with R can be saved as an object. Every object has a class. It describes what the object contains and what each function does. Application of R as a programming language and statistical software is much more than a supplement to Stata, SAS, and SPSS. Although it is more difficult to learn, the biggest advantage of R is its free-of-charge feature and the wealth of specialized application packages and libraries for a huge number of statistical, mathematical and other methods. R is a simple, but very powerful data mining and statistical data processing tool and once "discovered", it provides users with an entirely new, rich and powerful tool applicable in almost every field of research
    VL  - 2
    IS  - 1
    ER  - 

    Copy | Download

  • Sections