Close

Publicaciones

Reconocer y hacer referencia al patrimonio del software

A continuación encontrará una lista de publicaciones científicas relevantes producidas como parte de nuestra misión.

Si su trabajo científico se benefició de Software Heritage, le animamos a que lo reconozca en sus publicaciones. La forma preferida de hacerlo es: (1) agregar una nota a pie de página en la portada de sus artículos como esta: “Este trabajo fue posible gracias a Software Heritage, la gran biblioteca de código fuente: https://www.softwareheritage. organización”; y (2) citar al menos uno de los artículos de iPres 2017 y CACM 2018 (de la lista a continuación) en la sección de Referencias de sus publicaciones científicas.

Política de publicación

Estamos comprometidos con el acceso abierto y nos esforzamos por hacer disponibles abiertamente todas las publicaciones financiadas por o para Software Heritage, si es posible bajo una licencia CC-BY-4.0. Cuando es necesario, hacemos una copia de la (pre)publicación disponible a través de enlaces en esta página.

Publicaciones

2025

Morane Gruenpeter, Roberto Di Cosmo, Sabrina Granger, Jean-François Abramatic, Laetitia Cruse, Nicole Martinelli

Software Heritage: Open Science Strategic Blueprint Informe técnico

2025.

BibTeX | Enlaces:

Alex Ioannidis, Morane Gruenpeter, Roberto Di Cosmo, Sabrina Granger, R Boyer, David Douard, Valentin Lorentz, Antoine Lambert, Alain Monteil, Jozefina Sadowska, Estelle Nivault, Eko Indarto, Wilko Steinhoff, Saadet Bozaci, Michael Wagner, Michael Didas, Maxence Azzouz-Thuderoz, Moritz Schubotz, Thanasis Vergoulis, Paolo Manghi, Serafeim Chatzopoulos, Themis Zamani, Kostas Koumantaros, Konstantinos Kagkelidis, Kyriakos Gkinis

Deliverable D6.2 – Report on Research Software Archival APIs and Connectors Informe técnico

2025.

BibTeX | Enlaces:

Adèle Desmazières, Roberto Di Cosmo, Valentin Lorentz

50 Years of Programming Language Evolution through the Software Heritage looking glass Proceedings Article

En: IEEE, (Ed.): Mining Software Repositories, Ottawa (Canada), Canada, 2025.

BibTeX | Enlaces:

Olivier Barais, Roberto Di Cosmo, Ludovic Mé, Stefano Zacchiroli, Olivier Zendra

Software Identification for Cybersecurity: Survey and Recommendations for Regulators Sin publicar

2025, (working paper or preprint).

BibTeX | Enlaces:

Roberto Di Cosmo, Sabrina Granger, Konrad Hinsen, Nicolas Jullien, Daniel Le Berre, Violaine Louvet, Camille Maumet, Clémentine Maurice, Raphaël Monat, Nicolas P. Rougier

CODE beyond FAIR Sin publicar

2025, (working paper or preprint).

BibTeX | Enlaces:

2024

Morane Gruenpeter, Sabrina Granger, Alain Monteil, Jozefina Sadowska, Estelle Nivault, Alex Ioannidis, Maxence Azzouz-Thuderoz, Michael Wagner, Saadet Bozaci, Sarala Wimalaratne, Sven Bingert, Göksenin Cakir

Deliverable D6.1 | Report on Standardisation and Curation of Software Metadata and PIDs Informe técnico

2024.

BibTeX | Enlaces:

Ludovic Courtès, Timothy Sample, Simon Tournier, Stefano Zacchiroli

Source Code Archiving to the Rescue of Reproducible Deployment Proceedings Article

En: 2024 ACM Conference on Reproducibility and Replicability, pp. 10 pages, ACM, 2024.

Resumen | BibTeX | Enlaces:

Tommaso Fontana, Sebastiano Vigna, Stefano Zacchiroli

WebGraph: The Next Generation (Is in Rust) Proceedings Article

En: Companion Proceedings of the ACM Web Conference 2024 (WWW '24 Companion), pp. 686-689, ACM, 2024.

Resumen | BibTeX | Enlaces:

Annalí Casanueva, Davide Rossi, Stefano Zacchiroli, Théo Zimmermann

The Impact of the COVID-19 Pandemic on Women’s Contribution to Public Code Artículo de revista

En: Empirical Software Engineering, 2024.

BibTeX | Enlaces:

2023

Mathilde Fichen, Morane Gruenpeter , Jérémy Bobbio , Sabrina Granger , Roberto Di Cosmo, Jean-François Abramatic, Isabelle Astic , Emmanuelle Bermès, Camille Françoise, Claude Gomez, Wendy Hagenmaier, Grégory Miura, Carlo Montangero, Simon Phipps, Kenneth Seals-Nutt

SWHAP Workshop, September 14th and 15th, 2023 Actas de congresos

HAL, 2023.

Resumen | BibTeX | Enlaces:

Valentin Lorentz, Di Cosmo, Roberto, Stefano Zacchiroli

The Popular Content Filenames Dataset: Deriving Most Likely Filenames from the Software Heritage Archive Sin publicar

2023, (working paper or preprint).

Resumen | BibTeX | Enlaces:

Romain Lefeuvre, Jessie Galasso, Benoit Combemale, Houari Sahraoui, Stefano Zacchiroli

Fingerprinting and Building Large Reproducible Datasets Proceedings Article

En: 2023 ACM Conference on Reproducibility and Replicability, pp. 27-36, ACM, 2023.

Resumen | BibTeX | Enlaces:

Jesus M. Gonzalez-Barahona, Sergio Montes-Leon, Gregorio Robles, Stefano Zacchiroli

The Software Heritage License Dataset (2022 Edition) Artículo de revista

En: Empirical Software Engineering, 2023, ISSN: 1382-3256.

Resumen | BibTeX | Enlaces:

Roberto Di Cosmo, Stefano Zacchiroli

The Software Heritage Open Science Ecosystem Capítulo de libro

En: Mens, Tom; Roover, Coen De; Cleve, Anthony (Ed.): Software Ecosystems: Tooling and Analytics, pp. 33–61, Springer International Publishing, Cham, 2023, ISBN: 978-3-031-36060-2.

Resumen | BibTeX | Enlaces:

2022

Roberto Di Cosmo

Code Source Book Section

En: Dictionnaire du Numérique, vol. February, 2022.

BibTeX | Enlaces:

Davide Rossi, Stefano Zacchiroli

Worldwide Gender Differences in Public Code Contributions (and How They Have Been Affected by the COVID-19 Pandemic) Proceedings Article

En: 44th International Conference on Software Engineering (ICSE 2022) – Software Engineering in Society (SEIS) Track, pp. 172-183, ACM, 2022.

Resumen | BibTeX | Enlaces:

Stefano Zacchiroli

A Large-scale Dataset of (Open Source) License Text Variants Proceedings Article

En: The 2022 Mining Software Repositories Conference (MSR 2022), pp. 757-761, ACM, 2022.

Resumen | BibTeX | Enlaces:

Davide Rossi, Stefano Zacchiroli

Geographic Diversity in Public Code Contributions: An Exploratory Large-Scale Study Over 50 Years Proceedings Article

En: The 2022 Mining Software Repositories Conference (MSR 2022), pp. 80-85, ACM, 2022.

Resumen | BibTeX | Enlaces:

Daniele Serafini, Stefano Zacchiroli

Efficient Prior Publication Identification for Open Source Code Proceedings Article

En: 18th International Conference on Open Source Systems (OSS 2022), ACM, 2022.

Resumen | BibTeX | Enlaces:

Kevin Wellenzohn, Michael H. Böhlen, Sven Helmer, Antoine Pietri, Stefano Zacchiroli

Robust and Scalable Content-and-Structure Indexing Artículo de revista

En: the VLDB Journal, 2022, ISSN: 1066-8888.

Resumen | BibTeX | Enlaces:

Roberto Di Cosmo

Should We Preserve the World's Software History, And Can We? Proceedings Article

En: Silvello, Gianmaria; Corcho, Óscar; Manghi, Paolo; Nunzio, Giorgio Maria Di; Golub, Koraljka; Ferro, Nicola; Poggi, Antonella (Ed.): Linking Theory and Practice of Digital Libraries – 26th International Conference on Theory and Practice of Digital Libraries, TPDL 2022, Padua, Italy, September 20-23, 2022, Proceedings, pp. 3–7, Springer, 2022.

BibTeX | Enlaces:

Roberto Di Cosmo

Building the software pillar of Open Science Proceedings Article

En: Open Science European Conferencem (OSEC 2022), pp. 183–193, OpenEdition Press, 2022, ISBN: 9791036545627.

BibTeX | Enlaces:

2021

Antoine Pietri

Organizing the graph of public software development for large-scale mining Tesis doctoral

Université Paris Cité, 2021.

BibTeX | Enlaces:

Morane Gruenpeter, Roberto Di Cosmo, Katherine Thornton, Kenneth Seals-Nutt, Carlo Montangero, Guido Scatena

Software Stories for landmark legacy code Informe técnico

Inria 2021.

BibTeX | Enlaces:

Laura Bussi, Roberto Di Cosmo, Carlo Montangero, Guido Scatena

Preserving landmark legacy software with the Software Heritage Acquisition Process Proceedings Article

En: iPres2021 – 17th International Conference on Digital Preservation, Beijing, China, 2021.

BibTeX | Enlaces:

Stefano Zacchiroli

Gender Differences in Public Code Contributions: a 50-year Perspective Artículo de revista

En: IEEE Software, 2021, ISSN: 0740-7459.

Resumen | BibTeX | Enlaces:

Thibault Allançon, Antoine Pietri, Stefano Zacchiroli

The Software Heritage Filesystem (SwhFS): Integrating Source Code Archival with Development Proceedings Article

En: ICSE 2021: The 43rd International Conference on Software Engineering, pp. 45-48, IEEE, 2021.

Resumen | BibTeX | Enlaces:

2020

Morane Gruenpeter, Roberto Di Cosmo, Alice Allen, Anita Bandrowski, Peter Chan, Martin Fenner, Leyla Garcia, Catherine M Jones, Daniel S Katz, John Kunze, Moritz Schubotz, Ilian T Todorov

Use cases and identifier schemes for persistent software source code identification Informe técnico

2020, (Output from the Research Data Alliance/FORCE11 Software Source Code Identification Working group).

BibTeX | Enlaces:

Morane Gruenpeter, Roberto Di Cosmo, Hylke Koers, Patricia Herterich, Rob Hooft, Jessica Parland-von Essen, Jonas Tana, Tero Aalto, Sarah Jones

M2.15 Assessment report on 'FAIRness of software' Miscelánea

2020.

BibTeX | Enlaces:

Roberto Di Cosmo

Archiving and Referencing Source Code with Software Heritage Proceedings Article

En: ICMS, pp. 362–373, Springer, 2020, ISBN: 978-3-030-52200-1.

Resumen | BibTeX | Enlaces:

Guillaume Rousseau, Roberto Di Cosmo, Stefano Zacchiroli

Software provenance tracking at the scale of public source code Artículo de revista

En: Empirical Software Engineering, pp. 1-30, 2020, ISSN: 1573-7616.

Resumen | BibTeX | Enlaces:

Antoine Pietri, Guillaume Rousseau, Stefano Zacchiroli

Determining the Intrinsic Structure of Public Software Development History Proceedings Article

En: MSR 2020: The 17th International Conference on Mining Software Repositories, pp. 602-605, IEEE, 2020.

Resumen | BibTeX | Enlaces:

Antoine Pietri, Guillaume Rousseau, Stefano Zacchiroli

Forking Without Clicking: on How to Identify Software Repository Forks Proceedings Article

En: MSR 2020: The 17th International Conference on Mining Software Repositories, pp. 277-287, IEEE, 2020.

Resumen | BibTeX | Enlaces:

Antoine Pietri, Diomidis Spinellis, Stefano Zacchiroli

The Software Heritage Graph Dataset: Large-scale Analysis of Public Software Development History Proceedings Article

En: MSR 2020: The 17th International Conference on Mining Software Repositories, pp. 1-5, IEEE, 2020.

Resumen | BibTeX | Enlaces:

Roberto Di Cosmo, Marco Danelutto

[Rp] Reproducing and replicating the OCamlP3l experiment Artículo de revista

En: ReScience C, vol. 6, no 1, 2020.

Resumen | BibTeX | Enlaces:

Paolo Boldi, Antoine Pietri, Sebastiano Vigna, Stefano Zacchiroli

Ultra-Large-Scale Repository Analysis via Graph Compression Proceedings Article

En: SANER 2020: The 27th IEEE International Conference on Software Analysis, Evolution and Reengineering, pp. 184-194, IEEE, 2020.

Resumen | BibTeX | Enlaces:

Roberto Di Cosmo, Jose Benito Gonzalez Lopez, Jean-François Abramatic, Kay Graf, Miguel Colom, Paolo Manghi, Melissa Harrison, Yannick Barborini, Ville Tenhunen, Michael Wagner, Wolfgang Dalitz, Jason Maassen, Carlos Martinez-Ortiz, Elisabetta Ronchieri, Sam Yates, Moritz Schubotz, Leonardo Candela, Martin Fenner, Eric Jeangirard

Scholarly Infrastructures for Research Software Libro

European Commission. Directorate General for Research and Innovation., 2020, ISBN: 978-92-76-25568-0.

BibTeX | Enlaces:

Pierre Alliez, Roberto Di Cosmo, Benjamin Guedj, Alain Girault, Mohand-Said Hacid, Arnaud Legrand, Nicolas Rougier

Attributing and Referencing (Research) Software: Best Practices and Outlook From Inria Artículo de revista

En: Computing in Science Engineering, vol. 22, no 1, pp. 39-52, 2020, ISSN: 1558-366X.

Resumen | BibTeX | Enlaces:

Roberto Di Cosmo

Announcing biblatex-software Artículo de revista

En: ACM SIGSOFT Software Engineering Notes, vol. 45, no 4, pp. 22–23, 2020.

BibTeX | Enlaces:

Roberto Di Cosmo, Morane Gruenpeter, Bruno Marmol, Alain Monteil, Laurent Romary, Jozefina Sadowska

Curated Archiving of Research Software Artifacts: Lessons Learned from the French Open Archive (HAL) Artículo de revista

En: International Journal of Digital Curation, vol. 15, no 1, pp. 16, 2020.

Resumen | BibTeX | Enlaces:

Roberto Di Cosmo, Morane Gruenpeter, Stefano Zacchiroli

Referencing Source Code Artifacts: a Separate Concern in Software Citation Artículo de revista

En: Computing in Science & Engineering, 2020, ISSN: 1521-9615.

Resumen | BibTeX | Enlaces:

2019

Antoine Pietri, Diomidis Spinellis, Stefano Zacchiroli

The Software Heritage Graph Dataset: Public software development under one roof Proceedings Article

En: Proceedings of the 16th International Conference on Mining Software Repositories, pp. 138-142, IEEE Press, 2019.

Resumen | BibTeX | Enlaces:

Mélanie Clément-Fontaine, Roberto Di Cosmo, Bastien Guerry, Patrick Moreau, François Pellegrini

Encouraging a wider usage of software derived from research En línea

2019, (Position paper of the software working group of the French National Council for Open Science).

BibTeX | Enlaces:

2018

Antoine Pietri, Stefano Zacchiroli

Towards Universal Software Evolution Analysis Proceedings Article

En: BENEVOL 2018: The 17th Belgium-Netherlands Software Evolution Workshop, pp. 6-10, 2018, ISSN: 1613-0073.

Resumen | BibTeX | Enlaces:

Jean-François Abramatic, Roberto Di Cosmo, Stefano Zacchiroli

Building the Universal Archive of Source Code Artículo de revista

En: Communications of the ACM, vol. 61, no 10, pp. 29-31, 2018, ISSN: 0001-0782.

BibTeX | Enlaces:

Roberto Di Cosmo, Morane Gruenpeter, Stefano Zacchiroli

Identifiers for Digital Objects: the Case of Software Source Code Preservation Proceedings Article

En: iPRES 2018 – 15th International Conference on Digital Preservation, 2018.

BibTeX | Enlaces:

Yannick Barborini, Roberto Di Cosmo, Antoine R. Dumont, Morane Gruenpeter, Bruno P. Marmol, Alain Monteil, Jozefina Sadowska, Stefano Zacchiroli

The creation of a new type of scientific deposit: Software Miscelánea

RDA Eleventh Plenary Meeting, Berlin, Germany, 2018, (poster).

BibTeX | Enlaces:

Yannick Barborini, Roberto Di Cosmo, Antoine R. Dumont, Morane Gruenpeter, Bruno P. Marmol, Alain Monteil, Jozefina Sadowska, Stefano Zacchiroli

La création du nouveau type de dépôt scientifique – Le logiciel Miscelánea

JSO 2018 – 7es journées Science Ouverte Couperin : 100 % open access : initiatives pour une transition réussie, 2018, (poster).

BibTeX | Enlaces:

2017

Roberto Di Cosmo, Stefano Zacchiroli

Software Heritage: Why and How to Preserve Software Source Code Proceedings Article

En: iPRES 2017: 14th International Conference on Digital Preservation, Kyoto, Japan, 2017.

BibTeX | Enlaces: