Faculty Spotlights

The Molecules of Life

 

The Molecules of Life

 

Basic biology textbooks will tell you: all life on Earth is built from four types of molecules. The four molecules of life are proteins, carbohydrates, lipids, and nucleic acids, with each of the four groups vital for every single living organism.

Until a decade ago, conducting research on the evolution and interactions of large collections of atoms and molecules could only be modeled using very simple computer simulation experiments, because the computing power needed to handle the large data-sets to understand life at its most basic form just wasn’t available.

Experiment harnessed the full power of the HiPerGator supercomputer over the 2023 Winter Break.

Basic biology textbooks will tell you: all life on Earth is built from four types of molecules. The four molecules of life are proteins, carbohydrates, lipids, and nucleic acids, with each of the four groups vital for every single living organism.

Until a decade ago, conducting research on the evolution and interactions of large collections of atoms and molecules could only be modeled using very simple computer simulation experiments, because the computing power needed to handle the large data-sets to understand life at its most basic form just wasn’t available.

During UF’s 2023 Winter Break, Jinze Xue, a Ph.D. student in the Roitberg Computational Chemistry Group, conducted a large-scale early Earth chemistry experiment utilizing more than 1000 A100 GPUs on HiPerGator. A molecular dynamics experiment was performed on 22 million atoms that identified 12 amino acids, three nucleobases, one fatty acid, and two dipeptides. The discovery of larger molecules, which would not have been possible in smaller computing systems, is a significant achievement.

Dr. Adrian Roitberg, V.T. and Louise Jackson Professor in UF’s Department of Chemistry, and his research group have been exploring the use of Machine Learning (ML) to study chemical reactions for the past six years.

“Our previous success enabled us to use ML/AI to calculate energies and forces on molecular systems, with results that are identical to those of high-level quantum chemistry--but around one million times faster!

The accomplishments empowered us to ask the following question: Can one create ‘molecules of life’, such as amino acids and DNA bases starting from a simple mix of gases and simulating their reactions?

Roitberg continues:

“These questions have been asked before, but due to computational limitations, previous calculations used small numbers of atoms and could not explore the range of time needed to obtain results.  But with HiPerGator, we can do it!”

Erik Deumens, senior director in UFIT for Research Computing, explains how this full takeover of HiPerGator was possible:

"HiPerGator is heavily used throughout the year for research and education. But it has the unique capability to run very large `hero’ calculations that use the entire machine with the potential to lead to breakthroughs in science and scholarship. Such calculations require careful scheduling to not impact the normal business of supporting research and teaching, which is why we do them around quieter times, like Christmas and over the July 4th holidays.  When we found out about the work Dr. Roitberg’s group was doing, we approached him to try a `hero’ run with the code he developed.”

The emergence of AI and very powerful GPUs can enable such highly compute and data intensive scientific simulations to be carried out--calculations scientists could only imagine just a few years ago.  The combination of AI models that speed up the dynamics of very large collections of atoms and molecules coupled with the power of the University of Florida’s supercomputer (the fastest in U.S. higher education) enabled the experiment to be done.

Roitberg continues:

“Using Machine Learning methods, we created a simulation using the complete HiPerGator set of GPUs. We were able to see, in real time, the formations of almost every amino acid (e.g., alanine, glycine, etc.) and a number of very complex molecules.  This was very exciting to experience.”

This project is part of the effort to discover how complex molecules can form from basic building blocks, and to make the process automatic through large computer simulations. The molecules of life is one example of this general idea.

Roitberg notes that he and his research group spent many hours working with members of UFIT. Ms. Ying Zhang, UFIT’s AI Support Manager, ran point for this experiment.

“Ying put together a team comprised of Research Computing staff and staff from NVIDIA to help scale compute runs, provide invaluable advice and help, and accelerate analysis of the data to the point where the analyses were done in just seven hours, instead of the three days we initially expected it to take.  We met every week from initial conception to the final results, in a very fruitful collaboration.”

The results, and the short time in which HiPerGator was able to deliver them is inspiring new lines of inquiry from Roitberg and others who are getting closer every day to answering questions about how complex molecules, such as those seen in life, is formed.

“This is a great opportunity for UF faculty,” says Roitberg.  “Having HiPerGator in-house, with the incredible staff willing to go above and beyond to help researchers produce ground-breaking science like this, is something that makes my non-UF colleagues very jealous.”

HiPerGator, the University of Florida supercomputer, went online in 2013 with 16,000 cores. It has been expanded and upgraded every few years since its initial go-live. HiPerGator’s last big expansion with the latest technologies was in 2020 and put into production in 2021. HiPerGator is housed in the UF Data Center, a 25,000+ sq. ft, Leadership in Energy and Environmental Design (LEED®) certified building.  Learn more about HiPerGator:  https://rc.ufl.edu/get-started/hipergator/.

 

 

 

Yonghui Wu in front of HiperGator AI

Medical AI tool from UF and NVIDIA gets human thumbs-up in first study

A new artificial intelligence computer program created by researchers at UF and NVIDIA can generate doctors’ notes so well that two physicians couldn’t tell the difference, according to an early study from both groups. A new artificial intelligence computer program created by researchers at UF and NVIDIA can generate doctors’ notes so well that two physicians couldn’t tell the difference, according to an early study from both groups.

 

Dr. Mildred Maldonado-Molina

Enabling research @ UF Family Data Center

Billions of rows of data that is subject to specific protections under federal and state laws…UF’s Institute for Child Health Policy & the Family Data Center leverages this incredible data volume, often referred to as “Big Data”, to investigate ways to improve healthcare delivery in Florida. But first, staff must ensure the records they are entrusted with are safe from exposure.

Dr. Mildred Maldonado-Molina is leading the efforts at the Family Data Center. She worked closely with UFIT to develop a solution that works for research while also compliant with all information security and privacy laws. A scientist, she and her team did not have the time or expertise to build the necessary computing infrastructure. Roland Estrella, manager of clinical research for the Department of Health Outcomes and Biomedical Informatics, summarizes what was built:

“Working with UFIT, the Family Data Center developed a robust computing environment. We have on-demand access to data from multiple state agencies and the capacity to process billions of medical and administrative records under strict federal information security guidelines.”

Dr. Stephen Coombes

Identifying damage in the brain’s superhighway

UF researchers have developed a template showing the brain’s superhighways and how they are impacted by a stroke. The brain images required to create the template were processed on HiPerGator.

“We’re interested in the structure of the brain after a stroke,” said Stephen Coombes, assistant professor of Applied Physiology and Kinesiology. “Collecting and analyzing images of brains from people that haven’t experienced a stroke helps us track the different motor pathways in the brain; sort of a ‘Google Maps’ for the brain’s corticospinal tract.”

The benefits of mapping the corticospinal tract — it’s a superhighway for movement — can have a significant impact to the care and recovery of stroke patients.

“Knowing which part of the tract is damaged may be extremely helpful in predicting recovery after stroke,” notes Coombes. “Physical therapists can also use this information to prescribe more individualized rehabilitation exercises.”

Utilizing 3,000 HiPerGator cores, Coombes said Dr. Derek Archer’s imaging needs were completed in three months. Without HiPerGator’s processing power, analyzing the data on a single computer would have taken an astounding 42 years of processing time. For more information about Coombes’s and Archer’s work on the corticospinal tract template and its applied use capabilities, visit the Laboratory for Rehabilitation Neuroscience web page.

 

DR. MELANIE CORRELL

Bringing genetic information into crop models

 


 

AARON BEVERIDGE

Using HiPerGator to work with social media big data

 


 

DR. FORREST MASTERS

Reducing loss of life and property during extreme wind events

 


 

DR. JULIE JOHNSON

Understanding sources of variability in how people respond to medications

 


 

DR. DAMON WOODARD

Using biological signals to establish individual identity