3
Cristian Morales Alarcón, Ciro Radicelli García, Margarita Pomboza Floril
Espirales. Revista multidisciplinaria de investigación cientíca, vol 5, No. 36
Enero-marzo 2021. e-ISSN 2550-6862. págs 1-16
In addition, the number of citations that scientic articles published in a specic journal have
at a given time is considered, so in order to measure said quality, there are entities such as
the Institute for Scientic Information (ISI), attached to the company Thomson Reuters, that
uses the Journal Citation Report (JCR), which is nothing more than a database that presents
detailed gures about publications and their citations. However, in addition to the JCR index,
there are other databases that measure the quality of published documents, such as Scopus,
which is attached to Elsevier, and which is mainly run by the SCImago research group of
Spain (Valderrama, 2012), which use the SCImago Journal and Country Rank (SJR) and the
SCImago Institutions Ranking (SIR) as indicators.
The volumes of data that are stored in databases, allow a complete processing of the
information, for which it works in phases such as pre-processing, data mining itself and the
post-processing of said information. In this sense, to facilitate the retrieval and delivery of
information carried out by personnel who work with large volumes of data, such as librarians,
the horizons have been opened towards other professions that are called to cooperate, thus
we now have designers systems, data providers, publishers, vendors, archivists, engineers
and specialists in electronic text encoding, among others; whose opinions and experiences
will allow the development of adequate interfaces to facilitate the location, manipulation,
retrieval and use of digital information.
In reference to the aforementioned, Valcárcel (2004) mentions that the “minería de datos” (or
commonly called Data Mining), refers to the process of extracting knowledge from databases,
with the aim of discovering anomalous and/or interesting situations, as well as trends, patterns
and sequences in the data. For their part, Molina and Ribiero (2001) clarify that mining is the
integration of a set of areas whose purpose is to identify knowledge obtained from databases
that provide a bias towards decision-making. Likewise, Molina (2002) indicates that data mining
is a non-trivial process of valid, novel, potentially useful and understandable identication of
understandable patterns that are hidden in the data.
Thus, for this purpose, new tools have been created in order to facilitate access to the
accumulation of information that is generated daily, one of the most used being text mining,
which offers the possibility of exploring large amounts of non-organized texts, in addition
to establishing patterns and extracting useful knowledge. Text mining then refers to the
examination of a collection of documents in order to discover information that is not explicit
in the analyzed text (Nasukawa, Kawano, & Arimura, 2001).
The importance of text mining lies in the effectiveness of its predictive models, which have
saved time and money; as well as the improvement of the capacity to respond to the needs
of the interested parties, it is thus that the use of computer tools used for the discovery and
processing of information will improve the knowledge management process.