University of Washington will lead study arm of national AI initiative

Ophthalmologists at the University of Washington School of Medicine will have a major role in the National Institutes of Health’s initiative to realize AI’s promise to improve medical care more fully.

Past research has examined the ability of AI to discern hints of diabetic retinopathy from retinal scans such as the one above. Image credit: Wikimedia Commons

The NIH Common Fund identified the universities and institutes whose work over the next four years will set the foundation for its “Bridge to Artificial Intelligence” (Bridge2AI) program. The program’s goal is to generate flagship datasets about human health that scientists across domains of medicine can use. As a result, AI would have an open-source, broadly supported pathway to greater potential and impact.  

“The program recognizes AI’s power to learn to recognize disease from great volumes of data, but the efforts so far have been fundamentally problematic,” said Dr. Aaron Lee, a UW associate professor of ophthalmology.

“The patient data has been narrowly focused on a few diseases, and it mainly has been retrospective — that is, collected from previous research and not adequately representative of the diverse populations we treat,” he said. “It has led to AI algorithms that are potentially unbalanced in terms of patients’ age, ethnicity, race and gender.”

He and Dr. Cecilia Lee, also an associate professor of ophthalmology, are co-principal investigators of the UW Medicine-led study arm. The NIH is funding the project with $7.8 million in its first year, the majority of which will be disbursed to participating research sites.

Using diabetes as its model disease, the consortium will aim to collect data from 4,000 participants.  The goal is to generate an AI-friendly, hypothesis-agnostic dataset, Aaron Lee said.

“We will collect and organize patient data to create a model that scientists and labs could easily extend and replicate with other disease models in the future,” he said.

The diabetes study will employ digital technologies to capture more information about how the disease is affected by patients’ activities and environments. For a two-week span, each participant will use:

  • a continuous glucose monitor, to give a minute-by-minute profile of blood-sugar levels;
  • a high-frequency fitness tracker (e.g, FitBit) to monitor physical activity, whose data can be aligned with that from the glucose monitor;
  • an environmental sensor to be placed in their bedroom, which will measure pollutants, ambient temperature and amount of light (lumens).

“These are types of data not traditionally collected in a medical dataset, and we hope it will give us a more holistic picture of participants when they are not at the hospital,” Lee said.

To ensure the data is population-representative, the 4,000 participants will be balanced for three factors: disease severity, race/ethnicity, and sex. In the study, 1,000 participants will have no diabetes, 1,000 will have diabetes that is lifestyle-controlled, 1,000 will have diabetes that is oral medication-controlled, and 1,000 will be insulin-dependent. Each of those subgroups will be balanced between male and female and across Black, white, Asian and Latino populations.

“This is trying to get at the heart of the shortfall of AI to date, by collecting well-curated, well-balanced datasets,” Lee said. “These are intended to unlock a host of AI applications without the worry about biases being introduced by the data-collection model.”

The Lees’ computational ophthalmology lab has examined deep-learning models’ value in medical practice. In 2021, investigators tested seven algorithms designed to detect diabetic eye disease from retinal scans, and found that just one met the performance of human screeners.

“These worked fairly well in the screening context. There didn’t seem to be any bias in detecting disease in people of different races, but there was a decrease in performance with respect to people’s age. That is what this new project hopes to address,” Lee said. “If you don't have a well-constructed, balanced dataset, then the AI models will fail in underrepresented groups.”

Source: University of Washington