The Federated European Genome-phenome Archive (FEGA) has expanded its network and continues to evolve by embracing emerging technologies
FEGA nodes. Credit: Karen Arnott/EMBL-EBI
Summary
The Federated European Genome-phenome Archive enables secure access, sharing, and reuse for sensitive genomic data.
FEGA’s network has expanded to include more partners. Poland, Portugal and most recently Canada have joined the network alongside the original nodes: Sweden, Norway, Finland, Germany, and Spain.
FEGA is set to grow further by integrating multi-omics and clinical data, exploring federated analytics, and leveraging emerging technologies.
The Federated European Genome-phenome Archive (FEGA) is transforming how sensitive human genomic data is shared and accessed. Building on the longstanding European Genome-phenome Archive (EGA) – a database for human genetic, phenotypic, and clinical data – FEGA enables researchers to securely discover and analyse human genomic data while ensuring compliance with national and international data protection regulations. By decentralising data storage across national nodes, FEGA ensures data remains secure within its country of origin while also being accessible to researchers worldwide.
To mark the news that Canada has now joined FEGA and to celebrate the recent publication of FEGA’s landmark paper in Nature Genetics, we take a look at FEGA’s achievements so far, the network’s future, and the collaborative effort driving the initiative.
From centralisation to federation
In the last two decades, as the volume of human genomic datasets grew larger and legal requirements to access the data were tightened, it became clear that a single repository like the EGA could no longer meet the global community’s needs. The EGA team proposed adopting a federated model, in line with local legal and ethical frameworks, while also allowing access to these data globally.
What is a federated model?
A federated model is a distributed network in which multiple entities operate under a common framework. Each node or entity retains control over its own data and processes, while agreed-upon standards and protocols allow them to collaborate and share data globally. This approach enables secure and coordinated data sharing across different countries, ensuring that local legal and ethical requirements are met.
EMBL-EBI and the Centre for Genomic Regulation (CRG) in Spain have co-managed the EGA since 2008. Together, they helped shape the federated data-sharing model now used by FEGA.
Participants at the FEGA launch symposium. Credit: Centre for Genomic Regulation
Two major ELIXIR-funded projects also helped make FEGA’s federated model possible. The ELIXIR-EXCELERATE initiative – a programme supporting international data sharing – helped lay the early groundwork with proof-of-concept studies. ELIXIR-CONVERGE – a project to standardise data management across Europe – has accelerated the development of the first national FEGA nodes.
“In the early days of setting up FEGA, we were grappling with the limitations of a centralised system. It felt like we were facing enormous challenges,” said Mallory Freeberg, Human Genomics Team Leader at EMBL-EBI. “Through collaboration, we overcame those obstacles to build the foundations of FEGA. I’m immensely proud to have been part of this endeavour. It’s an exciting time for this network, and there is so much more to achieve as we continue pushing the boundaries.”
By 2020, the foundation for FEGA’s federated model was set, leading to the official launch of the resource in September 2022. The first collaboration agreements were signed by nodes in Sweden, Norway, Finland, Germany, and Spain.
Federated cake pops to celebrate FEGA’s launch. Credit: Centre for Genomic Regulation
The first datasets were submitted to FEGA by late 2023, proving the federated model in practice. These data have since continued to grow, with FEGA now holding nearly 40 datasets. The network has also expanded with nodes in Portugal and Poland joining in 2023, and Canada in December 2024. Discussions are also ongoing with many other collaborators, including a Swiss node who are currently ‘Observers’ of FEGA.
Circulating small RNA sequencing data have been made available from the node in Norway; these can be used to identify biomarkers for endometrial cancer.
The first dataset shared from Poland is being used by researchers at the University of Vilnius to investigate the genetic basis of overgrowth syndrome, a rare genetic disorder that causes excessive growth in children.
Arcadi Navarro speaking at the FEGA launch symposium. Credit: Centre for Genomic Regulation
A federated future
Looking ahead, FEGA will also continue to refine its metadata standards, including those developed as part of the Global Alliance for Genomics and Health (GA4GH), to increase data discoverability and interoperability. This will support future integration of multi-omics datasets, as well as clinical health records for a more complete picture of disease biology.
Efforts to support more data submissions are also underway, including the development of training materials and the establishment of FEGA node-specific help desks to assist local users. Federated analytics are also in the works, which will enable secure, multi-node analyses without the need to download sensitive data.
As FEGA evolves, it sets a high standard for global human genomics data sharing. By integrating multi-omics and clinical data, FEGA will support research towards improved personalised medicine and help inform public health strategies to ultimately improve patient outcomes.