AIM-AHEAD Federated Network
About AIM-AHEAD
The AIM-AHEAD Coordinating Center was established to enhance diversity in the field of artificial intelligence and machine learning (AI/ML), with emphasis on reducing health disparities and promoting health equity. We hope to achieve these goals by engaging in a fair, equitable, and transparent process of building a consortium of AI/ML to promote health equity and an inclusive and diverse workforce. Many communities have untapped potential to contribute new expertise, data, recruitment strategies, and cutting-edge science to the AI/ML field. AIM-AHEAD Coordinating Center seeks to increase participation and engagement through mutually beneficial partnerships, stakeholder engagement, and outreach to advance health equity.
AIM-AHEAD Federated-Network Information
Due to institutional policies, sites with diverse data are often unable to share patient-level data outside of their institutions. Federated data networks allow for patient level-data to stay at the local site and the ability for sites to opt-in to participate; allowing for more diverse data and research while promoting data privacy and safety. This model also allows for greater inclusion and equity as sites that have not been able to engage in collaborations involving data sharing can participate in a federated model. Participating sites will receive assistance to set up local federated infrastructure and will be provided with centralized code to run on their data locally. Sites share ONLY aggregate data results as they feel comfortable. Aggregate data will be shared through use of a private GitHub repository so that they can be combined centrally and then shared with other network participants. The private Github repository will be password protected. ONLY AGGREGATE data will be shared, patient-level data will not be shared or stored outside of the local site. Easy-to-use dockerized environments can be set up at each participating site with data science tool kits to analyze electronic health record (EHR) data.
The AIM-AHEAD Federated Network allows sites to participate in collaborative research and share data across sites without the need for patient-level data to leave the local site. The program is open to institutions conducting research at the intersection of healthcare practice and delivery, AI/ML methods, and the advancement of health equity. The AIM-AHEAD Consortium seeks projects that represent diversity across fields of study, institutions, geography, and investigators.
This RFA calls for site participation in AIM-AHEAD federated research programs. The research team from each site is expected to (i) assemble EHR data locally from their institution to be used for AIM-AHEAD research projects; (ii) provide informatics expertise to participate in EHR data harmonization across sites; and (iii) facilitate the analysis of EHR data using AI/ML algorithms developed by the AIM-AHEAD researchers. While all patient-level EHR data remain behind the firewalls of the local site, the research team is expected to facilitate the sharing of aggregated data to support AIM-AHEAD research projects.
Expected outcomes for the participating site include protocols for data curation and data sharing fine tuned to their specific institution along with lessons learned. The sites are expected to work with the core team members to assess capacity in running AI/ML models locally and sharing outputs of AI/ML models. During year 2 of the funding cycle, the participating sites are expected to generate and share aggregated data for relevant research questions to enable the research teams to perform federated analyses.
The participating sites and the AIM-AHEAD Infrastructure Core are expected to jointly work on publications detailing both the process of data curation, data harmonization, AI/ML analyses with local data, as well as federated analyses for selected research questions.
Key Dates
The funding period will run for 24 months, starting September 3, 2024.
- Yr 1 Sept 3, 2024-Sept 2, 2025
- Yr 2 Sept 3, 2025-Sept 2, 2026
Specific Activities
Program Description
- AIM-AHEAD will select up to 5 organizations to participate in the Federated Data Network.
- Selected sites will have patient populations and investigator backgrounds and expertise that ensure a diverse network and research questions that align with the AIM-AHEAD North Stars.
- Site investigators, in collaboration with AIM-AHEAD leadership, will determine the research questions for the network and may participate in or lead the analyses.
- Research questions will be selected, in part, based on the types of data in the network, the resources available within the sites, and the feasibility of completing the research within the project period.
- IRB approval and associated agreements will be obtained to enable the request of EHR data for participating in research activities as part of the AIM-AHEAD federated network.
- No patient-level data will leave the participating sites. Analyses will be run locally, and only the aggregate results will be uploaded to a secure central location.
- A copy of the aggregate results will be posted to a private GitHub repository so that they can be combined centrally and then shared with other network participants.
- Central access to the individual sites’ aggregate results will be limited to investigators participating in the study. However, the combined data from all sites will be made publicly available. Publications based on the aggregate data will not present data subdivided by site unless all sites approve.
- Expected outcomes for the participating site include protocols for data curation and data sharing fine tuned to their specific institution along with lessons learned. The sites are expected to work with the core team members to assess capacity in running AI/ML models locally and sharing outputs of AI/ML models. During year 2 of the funding cycle, the participating sites are expected to generate and share aggregated data for relevant research questions to enable the research teams to perform federated analyses.
- The participating sites and the core are expected to jointly work on publications detailing both the process of data curation, data harmonization, AI/ML analyses with local data, as well as federated analyses for selected research questions.
- Each institution will compute the aggregate study data locally on their clinical data warehouse using SQL and R/python code that is provided to them by the AIM-AHEAD Infrastructure core.
Application Requirements
- Applicants must be able to extract EHR data, save the data to a secure location, run analyses on the data from software distributed to sites on Docker (e.g., R or Python), and share the aggregate results with others on the project team. Easy-to-use dockerized environments can be set up at each participating site with data science tool kits to analyze electronic health record (EHR) data
- Applications must provide counts of the number of patients whose data will be available to use in the network. The counts should be a table, similar in format to the NIH Inclusion Enrollment Report, showing breakdowns by sex, race, and ethnicity. (See https://www.era.nih.gov/erahelp/ASSIST/Content/ASSIST_Help_Topics/3_Form_Screens/PHS_HS_CT/Incl_Enroll_Rprt.htm)
- Applications must provide a brief description of their site’s patient population, including the primary geographic region of the patients and whether it contains adult and/or pediatric patients. Other characteristics may be highlighted as well, such as patients from community health centers, ambulatory or dental clinics, geriatric care, etc.
- Applications must provide a brief description of the available data, including the data types (demographics, diagnoses, medications, laboratory test results, etc.) and approximate start and end years of the data.
- Applications must provide a brief description, to the extent possible, of the local environment, including how the investigators will access the EHR data, the secure location where data extracts will be stored, and what computational resources are available for analyses.
- Applications must describe any prior experience working with EHR data, common data models or ontologies, or data from multi-site clinical studies.
Proposed Budget
Year 1: Site Engagement - governance $100K Total cost per application
Year 2: Site Engagement - implementation $300K Total cost per application
AIM-AHEAD Federated Network program will begin on September 3, 2024.
Year 1 of the AIM-AHEAD Federated Network Program will be focused on the collective establishment of the network’s governance policies, and Year 2 will be focused on developing the technical components to enable federated research.
Up to 5 sites will be awarded.
Read the Call for Proposal for Federated Network
Application Process
Step 1: Click here to register as a “mentee/learner” on AIM-AHEAD Connect (our Community Building Platform)
Step 2: Click here to submit an application for review using InfoReady platform
* To submit your application in InfoReady, please use Chrome, Firefox, or Edge. If you're using Safari, make sure to clear your cache before logging in.
Please note both steps must be completed for consideration.
All applications must be received by June 10th, 2024, 11:59 PM Eastern Time.
Information Webinar
There will be an informational webinar for the AIM-AHEAD Federated Research Network on Monday, May 13, 2024, from 1:00 pm - 2:00 pm ET.
Click here to watch the Federated Network Informational Webinar.
Link to view the Federated Network Informational Webinar Slide Deck.
Frequently Asked Questions
Collections of answers to our most Frequently Asked Questions will be available here:
AIM-AHEAD Federated Network Program FAQs
Please visit this page regularly as we will continue to add additional questions and answers.
Helpdesk
Questions regarding the AIM-AHEAD Federated Research Network may be directed to: https://helpdesk.aim-ahead.net/ticket/create/federated_network
Program Directors
Paul Avillach, MD, PhD, Griffin Weber, MD, PhD, Usha Sambamoorthi, PhD, Gabriel Brat, MD, MPH, Tianxi Cai, PhD