septiembre 6, 2022

Post-Doctoral Research Visit F/M Software Heritage: large-scale empirical analysis of open source code artifacts

This postdoc position is open at Inria in the context of the Software Heritage project, on the topic of
large-scale empirical analysis of open source code artifacts, such as source code files and commits, as captured by state-of-the art distributed version control systems (VCSs).

Inria is a national research institute dedicated to digital sciences that promotes scientific excellence and transfer. Inria employs 2,400 collaborators organised in research project teams, usually in collaboration with its academic partners. This agility allows its scientists, from the best universities in the world, to meet the challenges of computer science and mathematics, either through multidisciplinarity or with industrial partners.

Software Heritage is a unique initative to build the universal archive of software source code, catering for the needs of research, industry and society as a whole.


With the help of the Software Heritage team, the recruited person will work on the analysis of the Software Heritage graph dataset (see the article online at, the largest publicly available corpus of Free and Open Source Software (FOSS) development history.

Main activities

To this end, the recruited person will:

  • Exploit the compressed representation of the Software Heritage graph (see the article online at to conduct empirical analyses on subsets of interest of the Software Heritage archive.
  • Propose, implement, and validate experimentally novel ways of exploiting the Software Heritage archive to conduct similar analyses in the future.
  • Produce and curate open datasets mixing and matching software artifacts from Software Heritage and related data sources.

The research activity will take place in a multidisciplinary team involving computer scientists and industry leaders with the purpose of analyzing free/open source software at the scale of Software Heritage. It will also involve documentation of the work and results in the form of conference proceedings and journal papers, and the presentation of the results at scientific meetings.

Benefits package

  • Subsidized meals
  • Partial reimbursement of public transport costs
  • Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
  • Possibility of teleworking and flexible organization of working hours
  • Professional equipment available (videoconferencing, loan of computer equipment, etc.)
  • Social, cultural and sports events and activities
  • Access to vocational training
  • Social security coverage

The keys to success

The ideal candidate will have obtained a PhD degree within the last 3 years, in computer science, applied math, computational science, or a related technical field, and will have proven experience in:

  • conducting experiments in the field of empirical software engineering
  • mining software artifacts from free/open source software repositories

It is expected that the candidate will have proven consensus builder in a highly collaborative environment and excellent written and oral communication skills.

Will be considered a plus:

  • experience with graph compression/analysis techniques
  • experience with machine learning and «big code» analysis
  • participation in Free/Open Sources Software development

Workplace and salary

The job must be worked on-site at Inria’s Paris headquarters.

The salary will be commensurate with experience and qualifications.

Application instructions

Applications must be submitted via email to The email must contain the subject line «post-doctoral research visit». A complete application should include:

  • resume,
  • brief statement of interest in the position,
  • links to any previous related work online.

Applications will be reviewed on a rolling basis until the position is filled.

Software Heritage is an equal opportunity employer and will not discriminate against any employee or application for employment on the basis of race, color, marital status, religion, age, sex, sexual orientation, national origin, handicap, or any other legally protected status. We value diversity in our workplace.

septiembre 6, 2022