<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article
  PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.0/JATS-journalpublishing1.dtd">
<article article-type="research-article" dtd-version="1.0" specific-use="sps-1.8" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
	<front>
		<journal-meta>
			<journal-id journal-id-type="publisher-id">rfing</journal-id>
			<journal-title-group>
				<journal-title>Revista Facultad de Ingeniería</journal-title>
				<abbrev-journal-title abbrev-type="publisher">Rev. Fac. ing.</abbrev-journal-title>
			</journal-title-group>
			<issn pub-type="ppub">0121-1129</issn>
			<issn pub-type="epub">2357-5328</issn>
			<publisher>
				<publisher-name>Universidad Pedagógica y Tecnológica de Colombia</publisher-name>
			</publisher>
		</journal-meta>
		<article-meta>
			<article-id pub-id-type="doi">10.19053/01211129.v33.n70.2024.18340</article-id>
			<article-id pub-id-type="publisher-id">00004</article-id>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Article</subject>
				</subj-group>
			</article-categories>
			<title-group>
				<article-title>EVALUATION OF FAIR PRINCIPLES IN RESEARCH DATA REPOSITORIES AT COLOMBIAN UNIVERSITIES</article-title>
				<trans-title-group xml:lang="es">
					<trans-title>Evaluación de principios FAIR en repositorios de datos de investigación en instituciones de educación superior en Colombia</trans-title>
				</trans-title-group>
			</title-group>
			<contrib-group>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0009-0004-1907-1965</contrib-id>
					<name>
						<surname>Lopez-Hoyos</surname>
						<given-names>Gineth-Andrea</given-names>
					</name>
					<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
				</contrib>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0002-2271-6101</contrib-id>
					<name>
						<surname>Roa-Martínez</surname>
						<given-names>Sandra-Milena</given-names>
					</name>
					<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
				</contrib>
			</contrib-group>
			<aff id="aff1">
				<label>1</label>
				<institution content-type="original"> Universidad del Cauca (Popayán-Colombia). lopezhoyos@unicauca.edu.co, https://orcid.org/0009-0004-1907-1965 </institution>
				<institution content-type="normalized">Universidad del Cauca</institution>
				<institution content-type="orgname">Universidad del Cauca</institution>
				<addr-line>
					<named-content content-type="city">Popayán</named-content>
				</addr-line>
				<country country="CO">Colombia</country>
				<email>lopezhoyos@unicauca.edu.co</email>
			</aff>
			<aff id="aff2">
				<label>2</label>
				<institution content-type="original"> Universidad del Cauca (Popayán-Colombia). smroa@unicauca.edu.co, https://orcid.org/0000-0002-2271-6101 </institution>
				<institution content-type="normalized">Universidad del Cauca</institution>
				<institution content-type="orgname">Universidad del Cauca</institution>
				<addr-line>
					<named-content content-type="city">Popayán</named-content>
				</addr-line>
				<country country="CO">Colombia</country>
				<email>smroa@unicauca.edu.co</email>
			</aff>
			<!--<pub-date date-type="pub" publication-format="electronic">
				<day>19</day>
				<month>12</month>
				<year>2024</year>
			</pub-date>
			<pub-date date-type="collection" publication-format="electronic">
				<season></season>
				<year></year>
			</pub-date>-->
			<pub-date pub-type="epub-ppub">
				<season>Oct-Dec</season>
				<year>2024</year>
			</pub-date>
			<volume>33</volume>
			<issue>70</issue>
			<elocation-id>e18340</elocation-id>
			<history>
				<date date-type="received">
					<day>18</day>
					<month>08</month>
					<year>2024</year>
				</date>
				<date date-type="accepted">
					<day>19</day>
					<month>11</month>
					<year>2024</year>
				</date>
			</history>
			<permissions>
				<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/" xml:lang="en">
					<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution License</license-p>
				</license>
			</permissions>
			<abstract>
				<title>ABSTRACT</title>
				<p>The volume of data generated in research processes represents an opportunity for collaborative work, scientific advancement, and integration into the open science movement. This is facilitated by the reuse of data for purposes such as reproducibility, among others. The main goal is to identify the landscape of research data repositories as a strategy for sharing or publishing research data from higher education institutions (HEIs) in Colombia. The research data repositories (RDR) in Colombian universities listed in the SCImago Journal Rank and the Papyrus meta-repository were analyzed, along with their respective datasets. Subsequently, the implementation of the FAIR principles (Findable, Accessible, Interoperable, Reusable) in these repositories was evaluated. It was found that although there are institutional repositories, there are few RDR. However, the initiatives to share datasets are evident when evaluating the FAIR principles, as indicated by the level of completeness found in these HEIs. It is concluded that initiatives like Papyrus in Colombia serve as platforms for indexing and making visible RDRs in Colombia, with significant contributions from HEIs. The implementation of the FAIR principles promotes the integration and sharing of data for open science.</p>
			</abstract>
			<trans-abstract xml:lang="es">
				<title>RESUMEN</title>
				<p>El volumen de datos generados en los procesos de investigación se constituye como una posibilidad de trabajo colaborativo, avance de la ciencia e integración al movimiento de ciencia abierta, lo cual se viabiliza a partir del reúso con fines de reproducibilidad que se puede dar a estos datos, entre otros. El objetivo de este trabajo fue identificar el panorama de los repositorios de datos de investigación (RDI) como estrategia de compartición o disponibilización de datos de investigación de las instituciones de educación superior (IES) en Colombia. Para ello, se analizó la existencia de RDI en universidades colombianas encontradas en el SCImago Journal Rank y en el meta-repositorio Papyrus y sus conjuntos de datos. Posteriormente, se evaluó la implementación de los principios FAIR (Findable, Accesible, Interoperable, Reusable) en dichos repositorios. Se encontró que, aunque existen repositorios institucionales, son pocos los RDI, no obstante, son claras las iniciativas de compartición de conjuntos de datos que se evidencian al evaluar los principios FAIR, debido al nivel de completitud encontrado en dichas IES. Se concluye que iniciativas como Papyrus en Colombia se presentan como plataformas de indexación y visibilidad de los RDI en Colombia, con alta contribución de las IES y la implementación de los principios FAIR favorecen la integración y compartición de datos para la ciencia abierta.</p>
			</trans-abstract>
			<kwd-group xml:lang="en">
				<title>Keywords:</title>
				<kwd>Access to information</kwd>
				<kwd>dataset</kwd>
				<kwd>open science</kwd>
			</kwd-group>
			<kwd-group xml:lang="es">
				<title>Palabras clave:</title>
				<kwd>Acceso a la información</kwd>
				<kwd>ciencia abierta</kwd>
				<kwd>conjunto de datos</kwd>
			</kwd-group>
			<counts>
				<fig-count count="2"/>
				<table-count count="5"/>
				<equation-count count="0"/>
				<ref-count count="13"/>
				<page-count count="0"/>
			</counts>
		</article-meta>
	</front>
	<body>
		<sec sec-type="intro">
			<title>1. INTRODUCTION</title>
			<p>Universities in Colombia generate a significant amount of high-value intellectual output for the scientific community (books, journals, articles, theses, data sets, among others), which may not be found or reused by other members of society. Duplicated efforts in similar areas could not consider the data generated by other researchers at the conceptual or practical level. Hence, there is a need for universities, particularly those that generate scientific knowledge, to share and publish this data. Additionally, these practices contribute to the research community and allow for the reproducibility and reuse of data, which are the pillars of open science<xref ref-type="bibr" rid="B1">[1]</xref>.</p>
			<p>Access to information produced in research is one of the main challenges researchers must face, highlighting the need for the implementation of digital ecosystems in universities across the country with the goal of fostering collaboration among researchers and the reuse of data, leading to structured advancements in the construction of science. Based on this, the objective of this work was to identify the national landscape regarding the sharing or publication of research data in the face of the growing volume of data and the open science movement, through a review of research data repositories in higher education institutions in Colombia. Additionally, it sought to determine whether the data repositories found allow their datasets to be findable, accessible, interoperable, and reusable, i.e., the incorporation of the FAIR principles (Findable, Accessible, Interoperable, Reusable), aiming to enhance and encourage practices of reuse, collaboration, and scientific advancement.</p>
			<p>Effective metadata usage is essential to help organize and specify the information generated through research in universities, which remains unutilized due to the lack of access <xref ref-type="bibr" rid="B2">[2]</xref>. Metadata primarily enables the implementation of the FAIR principles.</p>
			<p>As for research data, these are all raw data on which any research is based, and they may or may not be published when scientific progress is communicated, but they are what underpin new knowledge <xref ref-type="bibr" rid="B3">[3]</xref>. When this data is properly described and assigned metadata for subsequent retrieval, it is deposited in data repositories <xref ref-type="bibr" rid="B4">[4]</xref>. In contrast, open data is public information made available in formats that allow its use and reuse under an open license and without legal restrictions for its exploitation <xref ref-type="bibr" rid="B1">[1]</xref>, with the aim of generating value in processes of active transparency, accountability, citizen participation, research, and innovation. Thus, data repositories are associated with the idea of valuing data as an asset of the organization, while evaluating the ability to manage, and establish responsibilities in decision-making and related tasks, ensuring data quality and proper use <xref ref-type="bibr" rid="B5">[5]</xref>.</p>
			<p>Data repositories could be defined as a system and set of services designed as an archive for digital data with context, stability, and persistence<xref ref-type="bibr" rid="B6">[6]</xref>, or as a database infrastructure that collects, manages, and provides access to data, metadata, and associated documentation<xref ref-type="bibr" rid="B4">[4]</xref>. The material produced during the research is used to extract and validate results, promoting the idea that raw data can have a &quot;second life&quot; and be used beyond their initial purpose. Such uses can range from reusing data to produce new studies to serving as a means of verifying research results <xref ref-type="bibr" rid="B7">[7]</xref>.</p>
			<p>On the other hand, it was found that through Resolution 460 of 2022 <xref ref-type="bibr" rid="B8">[8]</xref>, the Ministry of Information and Communication Technologies (MinTIC Colombia) issued the National Data Infrastructure Plan (PNID) and its roadmap, to drive the digital transformation of the State and the development of a data-based economy.</p>
			<p>Some meta-repositories collect information from associated entities, such as R3Data, Datos.gov, and Papyrus, and are promoted by the national government and align with the national data infrastructure plan. R3Data is a global registry of research data repositories across various academic disciplines, promoting a culture of exchange, greater access, and better visibility of research data; it currently hosts 3,244 repositories.</p>
			<p>Datos.gov is Colombia's national open data platform, which promotes transparency and decision-making based on public data. Meanwhile, Papyrus is a multidisciplinary repository housing scientific datasets from research projects of institutions within the Consortia Consortium, with the aim of meeting international standards and managing persistent identifiers such as the Digital Object Identifier (DOI).</p>
			<p>According to the FAIR principles guide for scientific data management and administration, the following definitions are provided <xref ref-type="bibr" rid="B9">[9]</xref>: 1) <bold>Findable:</bold> Data and metadata should be discoverable by the community after publication, through search tools. 2) <bold>Accessible:</bold> Data and metadata should be accessible, allowing other researchers to download them using their identifiers. 3) <bold>Interoperable:</bold> Both data and metadata should be described according to community rules, using open standards, to enable their exchange and reuse. 4) <bold>Reusable:</bold> Data and metadata should be reusable by other researchers, with clear provenance and reuse conditions.</p>
		</sec>
		<sec>
			<title>2. RELATED WORK</title>
			<p>In Osorio et al. <xref ref-type="bibr" rid="B1">[1]</xref> an analysis of open data in Colombian higher education institutions (HEIs) is described, revealing that the publication of data in open format by HEIs is low, with only 5.4% of the sampled HEIs having published at least one dataset on the national open data portal. This highlights the need to encourage openness and the use of data in HEIs, establishing a framework for data governance that facilitates inter-institutional cooperation leading to knowledge generation and innovation.</p>
			<p>Although the government has made efforts to develop open data dissemination strategies, this study confirms the low adoption of these strategies in public and private entities. It also found that further research is needed to design tools (evaluation, diagnosis, and gap identification) to support the implementation of open data initiatives and boost interinstitutional collaboration for knowledge generation, products, and services for society.</p>
			<p>In Méndez et al. <xref ref-type="bibr" rid="B10">[10]</xref>, a review of the status of research data repository implementation in Spanish public universities was conducted. The first step involved selecting the universities to be analyzed for the study sample. The results show the effort being made to publish datasets (1955), with 10 universities accounting for 68.3% of the published data, and an increased interest in dataset publication since 2021. This study also documents the use of technologies in the implementation of research data repositories, identifying that Dspace and Dataverse are widely used, with other platforms like EPrints, InvenioRDM, and Zenodo implemented in some of the universities studied. Using the FAIR principles, an evaluation of the aforementioned platforms was conducted, showing that Dataverse satisfies the most FAIR principles (9 fully and 4 partially), while Dspace (6 fully and 5 partially) is the most widely installed. The study concludes that the implementation of research data repositories is in an early stage.</p>
		</sec>
		<sec sec-type="methods">
			<title>3. METHODOLOGY</title>
			<p>To analyze the research data repositories of Colombian universities, the four steps defined in <xref ref-type="bibr" rid="B11">[11]</xref> as follows were implemented:</p>
			<p>
				<table-wrap id="t1">
					<label>Table 1</label>
					<caption>
						<title>Selected universities.</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup>
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead>
							<tr>
								<th align="left">Rank <italic>(Global)</italic></th>
								<th align="left">Institution</th>
								<th align="left">Type</th>
								<th align="left">Institutional Repository</th>
								<th align="left">Data Repository</th>
							</tr>
						</thead>
						<tbody>
							<tr>
								<td align="left">1(1,000)</td>
								<td align="left">National University of Colombia</td>
								<td align="left">Public</td>
								<td align="left">X (DSpace)</td>
								<td align="left">N/A</td>
							</tr>
							<tr>
								<td align="left">2(1,922)</td>
								<td align="left">University of Antioquia</td>
								<td align="left">Public</td>
								<td align="left">X (DSpace)</td>
								<td align="left">N/A</td>
							</tr>
							<tr>
								<td align="left">3(2,467)</td>
								<td align="left">Pontifical Javeriana University</td>
								<td align="left">Private</td>
								<td align="left">X (DSpace)</td>
								<td align="left">N/A</td>
							</tr>
							<tr>
								<td align="left">4(2,472)</td>
								<td align="left">University of the Andes</td>
								<td align="left">Private</td>
								<td align="left">X (DSpace)</td>
								<td align="left">Dataverse</td>
							</tr>
							<tr>
								<td align="left">5(3,667)</td>
								<td align="left">Universidad del Rosario</td>
								<td align="left">Private</td>
								<td align="left">X (DSpace)</td>
								<td align="left">Dataverse</td>
							</tr>
							<tr>
								<td align="left">6(3,820)</td>
								<td align="left">University of the Coast</td>
								<td align="left">Public</td>
								<td align="left">X (DSpace)</td>
								<td align="left">Dataverse</td>
							</tr>
							<tr>
								<td align="left">7(4,776)</td>
								<td align="left">University of Valle</td>
								<td align="left">Public</td>
								<td align="left">X (DSpace)</td>
								<td align="left">N/A</td>
							</tr>
							<tr>
								<td align="left">8(4,834)</td>
								<td align="left">Pontifical Bolivarian University</td>
								<td align="left">Private</td>
								<td align="left">X (DSpace)</td>
								<td align="left">N/A</td>
							</tr>
							<tr>
								<td align="left">9(4,903)</td>
								<td align="left">University of La Sabana</td>
								<td align="left">Private</td>
								<td align="left">X (DSpace)</td>
								<td align="left">N/A</td>
							</tr>
							<tr>
								<td align="left">10(5,120)</td>
								<td align="left">Industrial University of Santander</td>
								<td align="left">Public</td>
								<td align="left">X (DSpace)</td>
								<td align="left">N/A</td>
							</tr>
						</tbody>
					</table>
				</table-wrap>
			</p>
			<p>
				<list list-type="alpha-lower">
					<list-item>
						<p>Definition of analysis criteria: As analysis criteria, it was defined that the data be published in the data repository or the institutional repository. Shared data sets will be reviewed and the FAIR criteria will be evaluated. As an additional step, R3data was used to validate if the repository is indexed in this search engine.</p>
					</list-item>
					<list-item>
						<p>Selection of the sample population: The SCImago Journal Rank (SJR) is a public access portal that includes journals and scientific indicators from developed countries based on information contained in the Scopus® database (Elsevier B.V.). The indicators provided can be used to evaluate and analyze scientific domains. The SJR is a classification of academic and research-related institutions based on a composite indicator that combines three different sets based on research performance, innovation results, and social impact through web visibility <xref ref-type="bibr" rid="B12">[12]</xref>. For this reason, the top 10 Colombian universities from the general SCImago ranking were selected to validate the existence of the data repository and assess the FAIR principles for each. The type of institution (private or public) and the thematic areas in which datasets are published, whether they have a data repository, and technologies were also identified. Table 1 shows the universities selected for the validation of this study.</p>
					</list-item>
					<list-item>
						<p>Data collection: For this stage, the official website of each of the universities was accessed and the institutional repository was searched first. Within this repository, it was validated if there is an attached data repository or if the data sets are accessible from it. Next, each of the links available on the main page to access the institutional repository was validated. When the link was not found on the page, the link related to the institutional library was searched to access the repository from there.</p>
					</list-item>
					<list-item>
						<p>Analysis of data repositories: The following activities were carried out for this analysis:</p>
					</list-item>
				</list>
			</p>
			<p>
				<list list-type="order">
					<list-item>
						<p>Accessing the university's website. Searching for the research data repository.</p>
					</list-item>
					<list-item>
						<p>Verifying the technology implemented in the research data repository.</p>
					</list-item>
					<list-item>
						<p>Searching within the repository for published datasets.</p>
					</list-item>
					<list-item>
						<p>Evaluating FAIR principles for repositories that have published datasets.</p>
					</list-item>
				</list>
			</p>
			<p>In <xref ref-type="table" rid="t2">Table 2</xref>, the evaluation criteria used to assess the FAIR principles are described, which help determine the current state of data repository implementation at the Colombian universities in the sample. During this analysis, it was found that few of the selected universities had datasets in a data repository, which led to the inclusion of an additional group of institutions that were identified through the snowball effect, a technique recognized in scientific research, allowing for an expanded sample to validate the FAIR principles. Therefore, it was necessary to carry out stages 3 and 4 of this methodology for this new group of universities.</p>
			<p>
				<table-wrap id="t2">
					<label>Table 2</label>
					<caption>
						<title>Evaluation Criteria. Adapted from Langer at al. <xref ref-type="bibr" rid="B13">[13]</xref>
						</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup>
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead>
							<tr>
								<th align="center">SubGroup FAIR</th>
								<th align="center">Criterion</th>
								<th align="center">Description</th>
							</tr>
						</thead>
						<tbody>
							<tr>
								<td align="justify" rowspan="3"> FINDABLE </td>
								<td align="justify">F1</td>
								<td align="justify">Is a particular research data set in a current?</td>
							</tr>
							<tr>
								<td align="justify">F2</td>
								<td align="justify">Is the research data information in that platform indexed in data catalogs, registries, and search engines?</td>
							</tr>
							<tr>
								<td align="justify">F3</td>
								<td align="justify">Is a search interface available with filter possibilities for structured Linked Data?</td>
							</tr>
							<tr>
								<td align="justify" rowspan="4"> ACCESSIBLE </td>
								<td align="justify">A1</td>
								<td align="justify">Can new research data be stored or referenced easily?</td>
							</tr>
							<tr>
								<td align="justify">A2</td>
								<td align="justify">Is the user input interface Linked Data-aware and easy to use by hiding technical terms and identifiers? </td>
							</tr>
							<tr>
								<td align="justify">A3</td>
								<td align="justify">Can the research data or metadata be accessed directly via http(s)?</td>
							</tr>
							<tr>
								<td align="justify">A4</td>
								<td align="justify">Do the authentication and authorization settings for public/private/restricted access exist?</td>
							</tr>
							<tr>
								<td align="justify" rowspan="4"> INTEROPERABLE </td>
								<td align="justify">I1</td>
								<td align="justify">Is the metadata description available in an RDF serialization?</td>
							</tr>
							<tr>
								<td align="justify">I2</td>
								<td align="justify">Can particular established ontologies be used to describe the research dataset in a general way such as <italic>schema.org/Dataset, Datacite, DCAT/DublinCore?</italic></td>
							</tr>
							<tr>
								<td align="justify">I3</td>
								<td align="justify">Can domain-specific vocabularies be used to further describe the research data set?</td>
							</tr>
							<tr>
								<td align="justify">I4</td>
								<td align="justify">Can each concept related to the research data set be described with a corresponding URI?</td>
							</tr>
							<tr>
								<td align="justify" rowspan="4"> REUSABLE </td>
								<td align="justify">R1</td>
								<td align="justify">Can a data license be specified in a Linked Data fashion?</td>
							</tr>
							<tr>
								<td align="justify">R2</td>
								<td align="justify">Can the data provenance be specified and updated in a structured way?</td>
							</tr>
							<tr>
								<td align="justify">R3</td>
								<td align="justify">Are data sets in a relationship based on Linked Data and criteria such as the topic, community, used method, or similar? </td>
							</tr>
							<tr>
								<td align="justify">R4</td>
								<td align="justify">Is the provided data validated or do compliance checks exist?</td>
							</tr>
						</tbody>
					</table>
				</table-wrap>
			</p>
			<p>The scoring system used to evaluate the FAIR principles based on the criteria is presented in <xref ref-type="table" rid="t3">Table 3</xref>, which was adapted from <xref ref-type="bibr" rid="B13">[13]</xref>. Additionally, a value column was added to perform a comparative analysis between the data repositories of the universities in the sample.</p>
			<p>
				<table-wrap id="t3">
					<label>Table 3</label>
					<caption>
						<title>Scoring System. Adapted from Langer at al. <xref ref-type="bibr" rid="B13">[13]</xref>
						</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup>
							<col/>
							<col/>
							<col/>
						</colgroup>
						<tbody>
							<tr>
								<td align="justify">I<bold>ndicator</bold></td>
								<td align="justify">Description</td>
								<td align="justify">Value</td>
							</tr>
							<tr>
								<td align="justify">+</td>
								<td align="justify">If the criterion was entirely fulfilled.</td>
								<td align="justify">1</td>
							</tr>
							<tr>
								<td align="justify">o</td>
								<td align="justify">If the criterion was partially fulfilled.</td>
								<td align="justify">0.5</td>
							</tr>
							<tr>
								<td align="justify">-</td>
								<td align="justify">If the criterion was not fulfilled.</td>
								<td align="justify">0</td>
							</tr>
							<tr>
								<td align="justify">()</td>
								<td align="justify">If the feature is limited in the native version but might be there with plugins.</td>
								<td align="justify">ND</td>
							</tr>
							<tr>
								<td align="justify"><sub>?</sub></td>
								<td align="justify">If it was not possible to assess the mentioned criterion.</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">%</td>
								<td align="justify">If the criterion was not applicable.</td>
								<td align="justify">NE</td>
							</tr>
						</tbody>
					</table>
				</table-wrap>
			</p>
		</sec>
		<sec sec-type="results">
			<title>4. RESULTS</title>
			<p>Once on the official page of each university from <xref ref-type="table" rid="t1">Table 1</xref>, the institutional repository and the data repository were searched, to conduct an analysis of the FAIR principles that can be adopted according to the tool used, as was done in the work by <xref ref-type="bibr" rid="B10">[10]</xref>. When reviewing the institutional and data repository of each university, the following were identified: the number of degree thesis, articles, datasets, and institutional documents indexed. <xref ref-type="table" rid="t4">Table 4</xref> shows the count of the data found in the university repositories analyzed. It is worth noting that the universities were added as indicated in the methodology section and highlighted IES without datasets published.</p>
			<p>
				<table-wrap id="t4">
					<label>Table 4</label>
					<caption>
						<title>Products found in the repositories. <italic>(PY)</italic></title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup>
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
							<col/>
						</colgroup>
						<thead>
							<tr>
								<th align="center">Rank</th>
								<th align="center">Institution (Website)</th>
								<th align="center">TS</th>
								<th align="center">PR</th>
								<th align="center">DS</th>
								<th align="center">ID</th>
								<th align="center">R3</th>
								<th align="center">DG</th>
								<th align="center">PY</th>
							</tr>
						</thead>
						<tbody>
							<tr>
								<td align="justify">1</td>
								<td align="justify">National University of Colombia (<ext-link ext-link-type="uri" xlink:href="https://repositorio.unal.edu.co/">https://repositorio.unal.edu.co</ext-link> )</td>
								<td align="justify">25,202</td>
								<td align="justify">36,236</td>
								<td align="justify">0</td>
								<td align="justify">78</td>
								<td align="justify">1</td>
								<td align="justify">6</td>
								<td align="justify">0</td>
							</tr>
							<tr>
								<td align="justify">2</td>
								<td align="justify">University of Antioquia (<ext-link ext-link-type="uri" xlink:href="https://bibliotecadigital.udea.edu.co/">https://bibliotecadigital.udea.edu.co</ext-link> )</td>
								<td align="justify">18,244</td>
								<td align="justify">701</td>
								<td align="justify">0</td>
								<td align="justify">1,176</td>
								<td align="justify">0</td>
								<td align="justify">4</td>
								<td align="justify">0</td>
							</tr>
							<tr>
								<td align="justify">3</td>
								<td align="justify">Pontifical Javeriana University (<ext-link ext-link-type="uri" xlink:href="https://repository.javeriana.edu.co/">https://repository.javeriana.edu.co</ext-link> )</td>
								<td align="justify">7,527</td>
								<td align="justify">11,994</td>
								<td align="justify">0</td>
								<td align="justify">157</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
							</tr>
							<tr>
								<td align="justify">4</td>
								<td align="justify">University of the Andes (<ext-link ext-link-type="uri" xlink:href="https://repositorio.uniandes.edu.co/">https://repositorio.uniandes.edu.co</ext-link> )</td>
								<td align="justify">32,350</td>
								<td align="justify">1,181</td>
								<td align="justify">66</td>
								<td align="justify">104</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">66</td>
							</tr>
							<tr>
								<td align="justify">5</td>
								<td align="justify">Universidad del Rosario (<ext-link ext-link-type="uri" xlink:href="https://repository.urosario.edu.co/">https://repository.urosario.edu.co</ext-link> )</td>
								<td align="justify">1,273</td>
								<td align="justify">11,621</td>
								<td align="justify">63</td>
								<td align="justify">133</td>
								<td align="justify">1</td>
								<td align="justify">0</td>
								<td align="justify">63</td>
							</tr>
							<tr>
								<td align="justify">6</td>
								<td align="justify">University of the Coast (<ext-link ext-link-type="uri" xlink:href="https://repositorio.cuc.edu.co/">https://repositorio.cuc.edu.co</ext-link> )</td>
								<td align="justify">2,814</td>
								<td align="justify">6,670</td>
								<td align="justify">8</td>
								<td align="justify">968</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">8</td>
							</tr>
							<tr>
								<td align="justify">7</td>
								<td align="justify">University of Valle (<ext-link ext-link-type="uri" xlink:href="https://bibliotecadigital.univalle.edu.co/home">https://bibliotecadigital.univalle.edu.co/home</ext-link> )</td>
								<td align="justify">11,754</td>
								<td align="justify">7,306</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">5</td>
								<td align="justify">0</td>
							</tr>
							<tr>
								<td align="justify">8</td>
								<td align="justify">Pontifical Bolivarian University (<ext-link ext-link-type="uri" xlink:href="https://repository.upb.edu.co/">https://repository.upb.edu.co</ext-link> )</td>
								<td align="justify">1,513</td>
								<td align="justify">2,527</td>
								<td align="justify">0</td>
								<td align="justify">16</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
							</tr>
							<tr>
								<td align="justify">9</td>
								<td align="justify">University of La Sabana (<ext-link ext-link-type="uri" xlink:href="https://intellectum.unisabana.edu.co/">https://intellectum.unisabana.edu.co</ext-link> )</td>
								<td align="justify">8,064</td>
								<td align="justify">2,769</td>
								<td align="justify">0</td>
								<td align="justify">2,098</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
							</tr>
							<tr>
								<td align="justify">10</td>
								<td align="justify">Industrial University of Santander (<ext-link ext-link-type="uri" xlink:href="https://noesis.uis.edu.co/">https://noesis.uis.edu.co</ext-link> )</td>
								<td align="justify">4,059</td>
								<td align="justify">5,382</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">5</td>
								<td align="justify">0</td>
							</tr>
							<tr>
								<td align="justify">11</td>
								<td align="justify">University Foundation of the Andean Region (<ext-link ext-link-type="uri" xlink:href="https://digitk.are-andina.edu.co/home">https://digitk.are-andina.edu.co/home</ext-link> )</td>
								<td align="justify">2,425</td>
								<td align="justify">343</td>
								<td align="justify">2</td>
								<td align="justify">54</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">2</td>
							</tr>
							<tr>
								<td align="justify">12</td>
								<td align="justify">Luis Amigó Catholic University (<ext-link ext-link-type="uri" xlink:href="http://repository.ucatolicaluis-amigo.edu.co/">http://repository.ucatolicaluis-amigo.edu.co</ext-link> )</td>
								<td align="justify">2,550</td>
								<td align="justify">2,486</td>
								<td align="justify">5</td>
								<td align="justify">2,005</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">8</td>
							</tr>
							<tr>
								<td align="justify">13</td>
								<td align="justify">Cooperative University of Colombia (<ext-link ext-link-type="uri" xlink:href="https://repository.ucc.edu.co/">https://repository.ucc.edu.co</ext-link> )</td>
								<td align="justify">23,447</td>
								<td align="justify">624</td>
								<td align="justify">187</td>
								<td align="justify">248</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">140</td>
							</tr>
							<tr>
								<td align="justify">14</td>
								<td align="justify">University of Medellín (<ext-link ext-link-type="uri" xlink:href="https://repository.udem.edu.co/">https://repository.udem.edu.co</ext-link> )</td>
								<td align="justify">4,404</td>
								<td align="justify">1,385</td>
								<td align="justify">18</td>
								<td align="justify">489</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">10</td>
							</tr>
							<tr>
								<td align="justify">15</td>
								<td align="justify">EAN University (<ext-link ext-link-type="uri" xlink:href="https://repository.universidadean.edu.co/">https://repository.universidadean.edu.co</ext-link> )</td>
								<td align="justify">4,221</td>
								<td align="justify">9</td>
								<td align="justify">1</td>
								<td align="justify">1,036</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">1</td>
							</tr>
							<tr>
								<td align="justify">16</td>
								<td align="justify">EIA University (<ext-link ext-link-type="uri" xlink:href="https://repository.eia.edu.co/">https://repository.eia.edu.co</ext-link> )</td>
								<td align="justify">3,347</td>
								<td align="justify">24</td>
								<td align="justify">2</td>
								<td align="justify">628</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">1</td>
							</tr>
							<tr>
								<td align="justify">17</td>
								<td align="justify">Santo Tomás University (<ext-link ext-link-type="uri" xlink:href="https://repository.usta.edu.co/">https://repository.usta.edu.co</ext-link> )</td>
								<td align="justify">25,892</td>
								<td align="justify">7,434</td>
								<td align="justify">17</td>
								<td align="justify">57</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">10</td>
							</tr>
							<tr>
								<td align="justify">18</td>
								<td align="justify">Externado University (<ext-link ext-link-type="uri" xlink:href="https://bdigital.uexternado.edu.co/">https://bdigital.uexternado.edu.co</ext-link> )</td>
								<td align="justify">5,476</td>
								<td align="justify">13,875</td>
								<td align="justify">2</td>
								<td align="justify">40</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">2</td>
							</tr>
							<tr>
								<td align="justify">19</td>
								<td align="justify">University of Caldas (<ext-link ext-link-type="uri" xlink:href="https://repositorio.ucaldas.edu.co/">https://repositorio.ucaldas.edu.co</ext-link> )</td>
								<td align="justify">1,186</td>
								<td align="justify">6,468</td>
								<td align="justify">0</td>
								<td align="justify">33</td>
								<td align="justify">0</td>
								<td align="justify">2</td>
								<td align="justify">0</td>
							</tr>
							<tr>
								<td align="justify">20</td>
								<td align="justify">El Bosque University (<ext-link ext-link-type="uri" xlink:href="https://repositorio.unbosque.edu.co/">https://repositorio.unbosque.edu.co</ext-link> )</td>
								<td align="justify">7,691</td>
								<td align="justify">2,280</td>
								<td align="justify">0</td>
								<td align="justify">28</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
							</tr>
							<tr>
								<td align="justify">21</td>
								<td align="justify">Higher College of Antioquia (<ext-link ext-link-type="uri" xlink:href="https://colmayor.janium.net/">https://colmayor.janium.net</ext-link> )</td>
								<td align="justify">617</td>
								<td align="justify">177</td>
								<td align="justify">0</td>
								<td align="justify">70</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
								<td align="justify">0</td>
							</tr>
						</tbody>
					</table>
					<table-wrap-foot>
						<fn id="TFN1">
							<p><italic>Thesis (TS), Paper (PR), DataSet (DS), Institutional Documents (ID), R3DATA(R3), DATAGOV(DG), Papyrus</italic></p>
						</fn>
					</table-wrap-foot>
				</table-wrap>
			</p>
			<p>Subsequently, it was reviewed whether the institutional and data repositories are indexed in open data repositories such as R3data, Datos.gov, and Papyrus. The aim was to determine how many data sets were shared in these repositories that follow the national government's initiatives in relation to the publication of open data. <xref ref-type="table" rid="t4">Table 4</xref> shows the number of datasets that the universities in the sample have made available on these platforms. After completing this review, it was found that currently, there are few universities sharing datasets and that they are simultaneously building their own data repository, which can be accessed by the rest of the research community and contributes to the development of science.</p>
			<p>Since only a few HEIs of the initial sample had research data repositories, which we removed from the list, and the same for the ones that did not have any data set, it was necessary to include other universities in this study to have a larger group that would allow for the evaluation of the FAIR principles in the data repositories of higher education institutions in Colombia. To achieve this, using the snowball effect, universities from the Papyrus repository were included. Table V presents the universities with a repository indexed in Papyrus that were added. It is worth noting that for the evaluation of the FAIR principles, only universities with at least one published dataset were considered. To evaluate the FAIR principles, the results matrix in <xref ref-type="table" rid="t5">Table 5</xref> was created, based on each of the criteria that were assessed.</p>
			<p>
				<table-wrap id="t5">
					<label>Table 5</label>
					<caption>
						<title>Evaluation of FAIR criteria in repositories</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup>
							<col/>
							<col/>
							<col span="3"/>
							<col span="4"/>
							<col span="4"/>
							<col span="4"/>
						</colgroup>
						<thead>
							<tr>
								<th align="justify">Institution</th>
								<th align="justify">DS</th>
								<th align="center" colspan="3">F </th>
								<th align="center" colspan="4">A </th>
								<th align="center" colspan="4">I </th>
								<th align="center" colspan="4">R </th>
							</tr>
							<tr>
								<th align="justify"> </th>
								<th align="justify"> </th>
								<th align="justify">1</th>
								<th align="justify">2</th>
								<th align="justify">3</th>
								<th align="justify">1</th>
								<th align="justify">2</th>
								<th align="justify">3</th>
								<th align="justify">4</th>
								<th align="justify">1</th>
								<th align="justify">2</th>
								<th align="justify">3</th>
								<th align="justify">4</th>
								<th align="justify">1</th>
								<th align="justify">2</th>
								<th align="justify">3</th>
								<th align="justify">4</th>
							</tr>
						</thead>
						<tbody>
							<tr>
								<td align="justify">University of the Andes</td>
								<td align="justify">66</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">University of the Coast</td>
								<td align="justify">8</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">Universidad del Rosario</td>
								<td align="justify">63</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">University Foundation of the Andean Region</td>
								<td align="justify">2</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">Luis Amigó Catholic University</td>
								<td align="justify">8</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">Cooperative University of Colombia</td>
								<td align="justify">140</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">University of Medellín</td>
								<td align="justify">10</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">EAN University</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">EIA University</td>
								<td align="justify">2</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">Santo Tomás University</td>
								<td align="justify">10</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
							<tr>
								<td align="justify">Externado University</td>
								<td align="justify">2</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">0.5</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">1</td>
								<td align="justify">NA</td>
							</tr>
						</tbody>
					</table>
				</table-wrap>
			</p>
			<p>Each of the FAIR principles was determined for the universities' repositories. The results indicate whether the responses to each of the questions regarding the datasets in these repositories are: Findable (F), Accessible (A), Interoperable (I), and Reusable (R), following the scoring system described in the methodology.</p>
			<p>A total of 11 universities were evaluated, including the 4 universities selected from the SCImago ranking, and the universities added through the snowball sampling. Those that did not have datasets available in Papyrus, such as Javeriana University, El Bosque University, University of Caldas, and Higher College of Antioquia were excluded at the time the data was collected.</p>
		</sec>
		<sec sec-type="discussion">
			<title>5. DISCUSSION</title>
			<p>Based on the results, it can be observed that the implementation of data repositories in Colombian universities is still in the initial phase. All the analyzed universities have an institutional repository that provides access to relevant information for researchers. In contrast, data repositories have only been implemented by 3 out of the 11 universities initially evaluated, demonstrating that despite national government directives, universities are not aligned with open data and the sharing of research data suggested by the Open Science movement. Additionally, on some of these institutions' websites, it was difficult to find the link associated with the institutional repository. In most cases (80%), one must access the library or use a search engine to directly access the institutional or data repository.</p>
			<p>
				<xref ref-type="fig" rid="f1">Figure 1</xref> shows that only two (2) universities, Universidad del Rosario and the National University of Colombia, have their data repositories indexed in R3data, offering the published information to the entire research community, thereby achieving greater visibility. At the same time, it is observed that only 4 universities from the sample have published a dataset on the DATOS.GOV website, which mainly corresponds to selection processes and mandatory financial reports for public entities. It is worth highlighting that none of the analyzed private universities publish datasets on this portal.</p>
			<p>Concerning the technology implemented for institutional repositories, it was found that the universities in the initial sample, selected based on the SCImago ranking, use DSpace, primarily due to its ease of implementation<xref ref-type="bibr" rid="B13">[13]</xref>. Therefore, it is important to highlight the FAIR principles that this technology can support and its main limitations. According to <xref ref-type="bibr" rid="B13">[13]</xref>, DSpace fully complies with 7 and partially with 4 of the 15 evaluated criteria. The four remaining criteria that are not met correspond to I1, I3, and I4, which are part of the INTEROPERABLE criterion and refer to the availability of metadata descriptions in RDF, the use of specific vocabularies, and the description of each concept related to the dataset, respectively.</p>
			<p>
				<fig id="f1">
					<label>Figure 1</label>
					<caption>
						<title><italic>
 <italic>Open data initiatives.</italic>
</italic></title>
					</caption>
					<graphic xlink:href="0121-1129-rfing-33-70-e18340-gf1.png"/>
				</fig>
			</p>
			<p>Lastly, R4 evaluates whether the provided data are validated as part of the REUSABLE criterion. Dataverse fully complies with 9 of the 15 evaluated criteria and partially with 4. Only two criteria are not met, I3 y R4, which coincide with two of the criteria that DSpace also fails to meet, corresponding to the use of specific vocabularies and data validation. Dataverse is the technology implemented in the Papyrus metadata repository from which they took the snowball sample.</p>
			<p>Regarding the review of the shared data in the repositories -theses, articles, datasets, and institutional documents-, it was found that the largest volume of data corresponds to theses (60%) and research articles (37%), accounting for 97% of the total data found. On the other hand, datasets make up only 0.11%. The Cooperative University of Colombia stands out with 140 datasets, the Universidad del Rosario, with 67 datasets, and the University of the Andes, with 63 datasets in its data repository.</p>
			<p>Concerning the thematic areas of the datasets, 32% correspond to Social Sciences, 20% to Medicine, Health, and Life Sciences, and 14% to Business and Administration. It was also found that areas such as Engineering and Computing and Information Sciences only contribute 9% and 7%, respectively, even though many processes require data collection and processing, which would suggest a higher number of datasets in these areas. Particularly, it is noted that, for example, the Cooperative University of Colombia contributes a high number of datasets in Medicine, Health, and Life Sciences, highlighting the publication of 37 datasets in 2023, compared to 2 datasets published in the current year, indicating a decrease in the dataset publication process.</p>
			<p>Regarding the FAIR principles, <xref ref-type="fig" rid="f2">Figure 2</xref> shows that since all datasets are available in Papyrus, they meet most of the criteria at the same percentage. The variations correspond to the use of more detailed metadata and the existence of access restrictions to the datasets. Concerning the FIND principle, it can be observed that all universities have a score of 2.5, corresponding to 2 criteria fully met and one partially met. This ensures that the data have a DOI, are indexed in data catalogs, and allow for data filtering, effectively supporting the principle of findability (data and metadata can be found by the community after being published through a search engine). The findability evaluated in this section indicates that the data and metadata can be accessed by the community through search tools, with the metadata including descriptive information about the context or characteristics of the data.</p>
			<p>About the ACCESSIBLE principle, which aims to ensure that data and metadata are accessible and can be downloaded by other researchers using their identifiers, a score of 3.5 was achieved for 54% of the universities in the sample. The remaining 46% corresponds to universities that have restrictions on accessing these data within the repository, which is why these universities scored 4 points. The criterion that was fully met and scored 1 point corresponds to A4, which validates whether there are authentication parameters for data usage. Data access can be carried out via a standard, open, free, and universally implementable communication protocol, and if necessary, there will be a system for data authentication and authorization. Access to metadata will remain even when the data is no longer available.</p>
			<p>For the INTEROPERABLE principle, a score of 3 was given to all the universities in the sample, indicating that three criteria were fully met, and one criterion, I3, could not be evaluated. This criterion corresponds to the use of domain-specific vocabularies. This does not mean that the goal of the criterion ensuring that both data and metadata are described according to community rules using open standards to allow for their exchange and reuse is not covered. It only suggests that new functionalities should be implemented to enable the use of metadata standards such as (schema.org/Dataset, DataCite, DCAT/ DublinCore), which focus on the interoperability of datasets, allowing their use in various tasks related to the thematic area. This grants researchers the possibility of reusing and reproducing experiments conducted by others, thus enhancing accessibility, enabling the sharing of information obtained from studies, and fostering active collaboration in innovation, solution creation, and reinforcing research results with the same focus.</p>
			<p>
				<fig id="f2">
					<label>Figure 2</label>
					<caption>
						<title><italic>
 <italic>FAIR principles evaluation</italic>
</italic></title>
					</caption>
					<graphic xlink:href="0121-1129-rfing-33-70-e18340-gf2.png"/>
				</fig>
			</p>
			<p>Finally, the REUSABLE principle, which helps ensure that data and metadata can be reused by other researchers, scored 2.5 in 100% of the sample. This indicates that two criteria, R1 and R2, are fully met, while R3 is partially met, and R4 could not be evaluated as it corresponds to data compliance controls that are not implemented in the technology. The reusability of data is one of the most important focuses in the open science movement, where linked data has a significant impact due to the possibility of linking datasets through their associated metadata. This allows researchers to find other datasets that can be used, thereby adding value to new research conducted. Due to the limitations of the technology implemented for data repositories, criteria 10 and 15 were not evaluated in this review, which is why they appear as NA (Not Applicable) in the results section table.</p>
			<p>In conclusion, it can be affirmed that the implementation of the FAIR principles has been gradually developed in the repositories of higher education institutions, laying the foundation for a more detailed implementation that supports the guidelines established to achieve the main goals of open science, namely data reuse and online accessibility.</p>
		</sec>
		<sec sec-type="conclusions">
			<title>6. CONCLUSIONS</title>
			<p>From the review conducted, it was found that the universities in the sample have institutional repositories. However, few of them have datasets available in these repositories, while others have implemented research data repositories that are indexed in meta-repositories such as Papyrus. This demonstrates the progress of higher education institutions in Colombia in terms of sharing and publishing data in open science.</p>
			<p>The results of the evaluation of the FAIR principles show that there are differences between the technologies implemented for data repositories and the definitions of the FAIR principles, particularly regarding the inclusion of Linked Data, domain-specific vocabularies, and, from another perspective, the controls for both access and validation of the provided data.</p>
			<p>Papyrus is an initiative in which many universities have partnered to pursue the common goal of publishing research datasets. For universities that do not have a data repository, joining this initiative could become an implementation platform focused on consolidating institutional efforts in this area.</p>
			<p>For future work, initiatives such as reviewing the FAIR criteria that are partially implemented, not implemented, or have not been evaluated are proposed, to improve the data repositories of Colombian universities, especially those related to Linked Data and specific vocabularies.</p>
		</sec>
	</body>
	<back>
		<ack>
			<title>ACKNOWLEDGMENTS</title>
			<p> We sincerely express our gratitude to the University of Cauca, especially to the Artificial Intelligence Research Group <bold>(GICO)</bold> for their support in developing this work</p>
		</ack>
		<ref-list>
			<title>REFERENCES</title>
			<ref id="B1">
				<label>[1]</label>
				<mixed-citation>M.-A. Osorio-Sanabria, F. O. Amaya Fernández, M. González-Zabala, &quot;Análisis de datos abiertos de instituciones de educación superior colombianas como apoyo a la relación Universidad-Entorno,&quot; <italic>Entramado</italic>, vol. 16, no. 1, pp. 272-284, Dec. 2020. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.18041/1900-3803/entramado.1.6127">https://doi.org/10.18041/1900-3803/entramado.1.6127</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Osorio-Sanabria</surname>
							<given-names>M.-A.</given-names>
						</name>
						<name>
							<surname>Amaya Fernández</surname>
							<given-names>F. O.</given-names>
						</name>
						<name>
							<surname>González-Zabala</surname>
							<given-names>M.</given-names>
						</name>
					</person-group>
					<article-title>Análisis de datos abiertos de instituciones de educación superior colombianas como apoyo a la relación Universidad-Entorno</article-title>
					<source>Entramado</source>
					<volume>16</volume>
					<issue>1</issue>
					<fpage>272</fpage>
					<lpage>284</lpage>
					<month>12</month>
					<year>2020</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.18041/1900-3803/entramado.1.6127">https://doi.org/10.18041/1900-3803/entramado.1.6127</ext-link>
				</element-citation>
			</ref>
			<ref id="B2">
				<label>[2]</label>
				<mixed-citation>P. Hartmann, J. Henkel, &quot;The rise of corporate science in AI: Data as a strategic resource,&quot; <italic>Academy of Management Discoveries</italic>, vol. 6, no. 3, pp. 359-381, Oct. 2020. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5465/amd.2019.0043">https://doi.org/10.5465/amd.2019.0043</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Hartmann</surname>
							<given-names>P.</given-names>
						</name>
						<name>
							<surname>Henkel</surname>
							<given-names>J.</given-names>
						</name>
					</person-group>
					<article-title>The rise of corporate science in AI: Data as a strategic resource</article-title>
					<source>Academy of Management Discoveries</source>
					<volume>6</volume>
					<issue>3</issue>
					<fpage>359</fpage>
					<lpage>381</lpage>
					<month>10</month>
					<year>2020</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5465/amd.2019.0043">https://doi.org/10.5465/amd.2019.0043</ext-link>
				</element-citation>
			</ref>
			<ref id="B3">
				<label>[3]</label>
				<mixed-citation>S. M. Roa-Martínez, S. A. Vidotti, R. C. Santana, &quot;Estructura propuesta del artículo de datos como publicación científica,&quot; <italic>Revista Española de Documentación Científica</italic>, vol. 40, no. 1, e167, Mar. 2017. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3989/redc.2017.1.1375">https://doi.org/10.3989/redc.2017.1.1375</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Roa-Martínez</surname>
							<given-names>S. M.</given-names>
						</name>
						<name>
							<surname>Vidotti</surname>
							<given-names>S. A.</given-names>
						</name>
						<name>
							<surname>Santana</surname>
							<given-names>R. C.</given-names>
						</name>
					</person-group>
					<article-title>Estructura propuesta del artículo de datos como publicación científica</article-title>
					<source>Revista Española de Documentación Científica</source>
					<volume>40</volume>
					<issue>1</issue>
					<elocation-id>e167</elocation-id>
					<month>03</month>
					<year>2017</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3989/redc.2017.1.1375">https://doi.org/10.3989/redc.2017.1.1375</ext-link>
				</element-citation>
			</ref>
			<ref id="B4">
				<label>[4]</label>
				<mixed-citation>S. Kowalczyk, K. Shankar, &quot;Data sharing in the sciences,&quot; <italic>Annual Review of Information Science and Technology</italic>, vol. 45, pp. 247-294, 2011. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/aris.2011.1440450113">https://doi.org/10.1002/aris.2011.1440450113</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Kowalczyk</surname>
							<given-names>S.</given-names>
						</name>
						<name>
							<surname>Shankar</surname>
							<given-names>K.</given-names>
						</name>
					</person-group>
					<article-title>Data sharing in the sciences</article-title>
					<source>Annual Review of Information Science and Technology</source>
					<volume>45</volume>
					<fpage>247</fpage>
					<lpage>294</lpage>
					<year>2011</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/aris.2011.1440450113">https://doi.org/10.1002/aris.2011.1440450113</ext-link>
				</element-citation>
			</ref>
			<ref id="B5">
				<label>[5]</label>
				<mixed-citation>M. Salvador, C. Ramie, &quot;Capacidades analíticas y gobernanza de datos en la Administración pública como paso previo a la introducción de la Inteligencia Artificial,&quot; <italic>Revista del CLAD Reforma y Democracia</italic>, no. 77, pp. 5-36, 2020.</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Salvador</surname>
							<given-names>M.</given-names>
						</name>
						<name>
							<surname>Ramie</surname>
							<given-names>C.</given-names>
						</name>
					</person-group>
					<article-title>Capacidades analíticas y gobernanza de datos en la Administración pública como paso previo a la introducción de la Inteligencia Artificial</article-title>
					<source>Revista del CLAD Reforma y Democracia</source>
					<issue>77</issue>
					<fpage>5</fpage>
					<lpage>36</lpage>
					<year>2020</year>
				</element-citation>
			</ref>
			<ref id="B6">
				<label>[6]</label>
				<mixed-citation>S. M. Angelozzi, &quot;La gestión de datos de investigación en abierto: introducción al rol emergente para las bibliotecas universitarias y científicas argentinas,&quot; <italic>Palabra Clave (La Plata)</italic>, vol. 9, no. 2, e091, Apr. 2020. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.24215/18539912E091">https://doi.org/10.24215/18539912E091</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Angelozzi</surname>
							<given-names>S. M.</given-names>
						</name>
					</person-group>
					<article-title>La gestión de datos de investigación en abierto: introducción al rol emergente para las bibliotecas universitarias y científicas argentinas</article-title>
					<source>Palabra Clave (La Plata)</source>
					<volume>9</volume>
					<issue>2</issue>
					<elocation-id>e091</elocation-id>
					<month>04</month>
					<year>2020</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.24215/18539912E091">https://doi.org/10.24215/18539912E091</ext-link>
				</element-citation>
			</ref>
			<ref id="B7">
				<label>[7]</label>
				<mixed-citation>R. Aleixandre-Benavent, R. Lucas-Domínguez, A. Sixto-Costoya, A. Vidal-Infer, &quot;The sharing of research data in the cell &amp; Tissue engineering area: Is it a common practice?,&quot; <italic>Stem Cells and Development</italic>, vol. 27, no. 11, pp. 717-722, Jun. 2018. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1089/SCD.2018.0036">https://doi.org/10.1089/SCD.2018.0036</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Aleixandre-Benavent</surname>
							<given-names>R.</given-names>
						</name>
						<name>
							<surname>Lucas-Domínguez</surname>
							<given-names>R.</given-names>
						</name>
						<name>
							<surname>Sixto-Costoya</surname>
							<given-names>A.</given-names>
						</name>
						<name>
							<surname>Vidal-Infer</surname>
							<given-names>A.</given-names>
						</name>
					</person-group>
					<article-title>The sharing of research data in the cell &amp; Tissue engineering area: Is it a common practice?</article-title>
					<source>Stem Cells and Development</source>
					<volume>27</volume>
					<issue>11</issue>
					<fpage>717</fpage>
					<lpage>722</lpage>
					<month>06</month>
					<year>2018</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1089/SCD.2018.0036">https://doi.org/10.1089/SCD.2018.0036</ext-link>
				</element-citation>
			</ref>
			<ref id="B8">
				<label>[8]</label>
				<mixed-citation>MinTIC, <italic>MinTIC expide el Plan Nacional de Infraestructura de Datos, que impulsará la transformación digital del Estado</italic>, 2024. <ext-link ext-link-type="uri" xlink:href="https://goo.su/QEUWUy">https://goo.su/QEUWUy</ext-link>
				</mixed-citation>
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<collab>MinTIC</collab>
					</person-group>
					<source>MinTIC expide el Plan Nacional de Infraestructura de Datos, que impulsará la transformación digital del Estado</source>
					<year>2024</year>
					<ext-link ext-link-type="uri" xlink:href="https://goo.su/QEUWUy">https://goo.su/QEUWUy</ext-link>
				</element-citation>
			</ref>
			<ref id="B9">
				<label>[9]</label>
				<mixed-citation>M. D. Wilkinson <italic>et al</italic>
 <italic>.,</italic> &quot;The FAIR Guiding Principles for scientific data management and stewardship,&quot; <italic>Scientific Data</italic>, vol. 3, no. 1, pp. 1-9, Mar. 2016. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/sdata.2016.18">https://doi.org/10.1038/sdata.2016.18</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Wilkinson</surname>
							<given-names>M. D.</given-names>
						</name>
						<etal/>
					</person-group>
					<article-title>The FAIR Guiding Principles for scientific data management and stewardship</article-title>
					<source>Scientific Data</source>
					<volume>3</volume>
					<issue>1</issue>
					<fpage>1</fpage>
					<lpage>9</lpage>
					<month>03</month>
					<year>2016</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/sdata.2016.18">https://doi.org/10.1038/sdata.2016.18</ext-link>
				</element-citation>
			</ref>
			<ref id="B10">
				<label>[10]</label>
				<mixed-citation>F. Méndez, A. Baptista, R. López, A. Vázquez, &quot;Implementación de los repositorios de datos de investigación en las universidades públicas españolas: estado de la cuestión,&quot; <italic>Scire</italic>, vol. 29, no. 2, pp. 39-49, Nov. 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.54886/scire.v29i2.4914">https://doi.org/10.54886/scire.v29i2.4914</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Méndez</surname>
							<given-names>F.</given-names>
						</name>
						<name>
							<surname>Baptista</surname>
							<given-names>A.</given-names>
						</name>
						<name>
							<surname>López</surname>
							<given-names>R.</given-names>
						</name>
						<name>
							<surname>Vázquez</surname>
							<given-names>A.</given-names>
						</name>
					</person-group>
					<article-title>Implementación de los repositorios de datos de investigación en las universidades públicas españolas: estado de la cuestión</article-title>
					<source>Scire</source>
					<volume>29</volume>
					<issue>2</issue>
					<fpage>39</fpage>
					<lpage>49</lpage>
					<month>11</month>
					<year>2023</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.54886/scire.v29i2.4914">https://doi.org/10.54886/scire.v29i2.4914</ext-link>
				</element-citation>
			</ref>
			<ref id="B11">
				<label>[11]</label>
				<mixed-citation>M. I. S. Oliveira, H. R. de Oliveira, L. A. Oliveira, B. F. Lóscio, &quot;Open Government Data Portals Analysis,&quot; in <italic>Proceedings of the 17th International Digital Government Research Conference on Digital Government Research</italic>, New York, NY, USA: ACM, 2016. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1145/2912160.2912163">https://doi.org/10.1145/2912160.2912163</ext-link>
				</mixed-citation>
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Oliveira</surname>
							<given-names>M. I. S.</given-names>
						</name>
						<name>
							<surname>Oliveira</surname>
							<given-names>H. R. de</given-names>
						</name>
						<name>
							<surname>Oliveira</surname>
							<given-names>L. A.</given-names>
						</name>
						<name>
							<surname>Lóscio</surname>
							<given-names>B. F.</given-names>
						</name>
					</person-group>
					<chapter-title>Open Government Data Portals Analysis</chapter-title>
					<source>Proceedings of the 17th International Digital Government Research Conference on Digital Government Research</source>
					<publisher-loc>New York, NY, USA</publisher-loc>
					<publisher-name>ACM</publisher-name>
					<year>2016</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1145/2912160.2912163">https://doi.org/10.1145/2912160.2912163</ext-link>
				</element-citation>
			</ref>
			<ref id="B12">
				<label>[12]</label>
				<mixed-citation>SCImago, <italic>SJR - SCImago Journal &amp; Country Rank</italic>, 2024. <ext-link ext-link-type="uri" xlink:href="https://www.scimagojr.com/aboutus.php">https://www.scimagojr.com/aboutus.php</ext-link>
				</mixed-citation>
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<collab>SCImago</collab>
					</person-group>
					<source>SJR - SCImago Journal &amp; Country Rank</source>
					<year>2024</year>
					<ext-link ext-link-type="uri" xlink:href="https://www.scimagojr.com/aboutus.php">https://www.scimagojr.com/aboutus.php</ext-link>
				</element-citation>
			</ref>
			<ref id="B13">
				<label>[13]</label>
				<mixed-citation>A. Langer, E. Bilz, M. Gaedke, <italic>Analysis of current RDM applications for the interdisciplinary publication of research data</italic>, 2019. <ext-link ext-link-type="uri" xlink:href="https://ceur-ws.org/Vol-2447/paper1.pdf">https://ceur-ws.org/Vol-2447/paper1.pdf</ext-link>
				</mixed-citation>
				<element-citation publication-type="book">
					<person-group person-group-type="author">
						<name>
							<surname>Langer</surname>
							<given-names>A.</given-names>
						</name>
						<name>
							<surname>Bilz</surname>
							<given-names>E.</given-names>
						</name>
						<name>
							<surname>Gaedke</surname>
							<given-names>M.</given-names>
						</name>
					</person-group>
					<source>Analysis of current RDM applications for the interdisciplinary publication of research data</source>
					<year>2019</year>
					<ext-link ext-link-type="uri" xlink:href="https://ceur-ws.org/Vol-2447/paper1.pdf">https://ceur-ws.org/Vol-2447/paper1.pdf</ext-link>
				</element-citation>
			</ref>
		</ref-list>
		<fn-group>
			<fn fn-type="other" id="fn0">
				<label>How to cite:</label>
				<p> G. A. Lopez-Hoyos, S. M. Roa-Martínez, &quot;Evaluation of FAIR Principles in Research Data Repositories at Colombian Universities&quot;. <italic>Revista Facultad de Ingeniería,</italic> vol. 33, no. 70, e18340, 2024. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.19053/01211129.v33.n70.2024.18340">https://doi.org/10.19053/01211129.v33.n70.2024.18340</ext-link>
				</p>
			</fn>
			<fn fn-type="other" id="fn1">
				<label>Gineth Andrea Lopez-Hoyos:</label>
				<p> Conceptualization; Methodology; Investigation; Formal Analysis; Writing-review and editing.</p>
			</fn>
			<fn fn-type="other" id="fn2">
				<label>Sandra Milena Roa-Martínez:</label>
				<p> Conceptualization; Supervision; Writing-review and editing. </p>
			</fn>
		</fn-group>
	</back>
</article>