
Santiago Israel Logroño Naranjo, Néstor Augusto Estrada Brito, Vanessa Alexandra Vásconez Núñez*.
Evelin Marisol Rosero Ordóñez
Espirales. Revista multidisciplinaria de investigación científica, Vol. 6, No. 41
April – June - 2022. e-ISSN 2550-6862. pp 1-13
For (Matsuda et al., 2021) data cleaning is performed after data processing and
organization. It involves searching for duplicates, errors, and inconsistencies in the data,
and subsequently eliminating them. The data cleaning process includes tasks such as
sorting data, matching records, identifying outliers, identifying inaccurate data,
maintaining data quality, and spell-checking textual data. As a result, it helps to avoid
unexpected results in delivering high quality data, which is important for a robust and
successful outcome.
Once the data sets are cleaned and free of errors, the analysis can proceed. Then,
different modes of techniques are applied such as, preliminary analysis of the data by
understanding the messages contained in the data obtained, and expressive statistics -
finding the mode, median, mean and others for Meng et al., (2021) data visualization is
one of the techniques used, in which the data are illustrated in a graphical format to
obtain additional observation regarding the information in the data.
Mathematical models or formulas (called algorithms), were added to the data in order
to discover relationships within variables; such as, the use of causality/correlation.
Results
Python is easy to master and learn. Most people can learn it, even those with less
programming knowledge. By using a popular language, there are many possibilities to
find a solution to problems that may arise. Watcharasupat et al., (2022) Writing code in
Python is easy, which enhances development. Also, Python can be accessed by design,
making it one of the fastest languages in terms of development speed. In addition,
reading Python code is intuitive, making it easy to maintain. Python's syntax is concise
and clear. The layout of the language is fairly close to English and readable, making it
easy to interpret.
Fewer lines of code are needed for Python to get results compared to other languages
such as Java or C. This simplicity of Python helps a lot when reading written code or
another developer's code. Code review is much faster and easier when there is less line
code to review, and the reading is more like English. There is less updating that is
involved each time code changes hands, the calculation is done quickly as far as each
function is concerned. With code that makes it easy to understand and navigate, the
user may be able to minimize the amount of work it takes to extend and maintain their
code base. Likewise, Python provides tried and tested versatility. Likewise, Python is
utilized by some assertive projects around the web like, Reddit, EVE Online, YouTube
to reliably serve the base to their users.
In terms of statistical analysis in Python, the path explained above is followed, but
through a practical example presented below: data collection in Python was done
through the creation of behavioral experiments or electronic surveys with flexibility in
how they presented audio/visual stimuli (e.g., text, shapes, sounds, images, movies,
animations), recording simple timing measurements (e.g., stimulus durations or onsets),