Apr 3 2004
Researchers at Rensselaer Polytechnic Institute today announced the release of a software program capable of quickly identifying molecules that show promise for future medicines. The software program enables drug makers to comb through enormous databases of potential molecules and identify the ones that have sound medicinal properties.
Rensselaer researchers with skills in computer science, chemistry, and math allied to create the software program. Chemistry Professor Curt Breneman, Mathematics Associate Professor Kristin Bennett, and Decision Sciences and Engineering Systems Associate Professor Mark Embrechts collaborated in the Drug Discovery and Semi-Supervised Learning project (DDASSL, pronounced “dazzle”), supported by a $1.2 million Knowledge and Distributed Intelligence Award from the National Science Foundation.
“The trick with drug discovery is to have the drug molecule fit like a key in a lock, because shape affects its performance,” Embrechts said. The safety and effectiveness of medicines depend on the shape and chemistry of the molecule. To find the most likely molecules, the new software makes use of two shortcuts in chemistry and math that enable the computer to search a vast molecular database rapidly.
The first shortcut describes the molecule, its shape and chemistry, in terms of numbers a computer can crunch rapidly. “Dr. Breneman has a technique to calculate electronic properties on the surface of a molecule very quickly,” Embrechts said. “It produces a description—basically a set of numbers—that the computer can use easily.”
Then, the second shortcut identifies which molecules have the right chemistry for a specific therapy. Using advanced pattern-recognition techniques known as kernel methods, the software analyzes a small sample database to identify molecules with the right chemical features. Once the key features are identified, the software can quickly screen large databases, accurately predicting the molecules that show potential.
“Conventional techniques are not truly predictive and don’t work,” Bennett said. “So we borrowed pattern recognition techniques already used in the pharmaceutical industry and added algorithms based on support vector machines. That gives us a technique to predict which molecules are promising.”
Rensselaer researchers noted that predictive modeling is one of a new breed of drug discovery methods that marks a shift in industry practice—a shift away from cell-based assays performed in the lab toward math-based models calculated on the computer.
“Our program allows researchers to ‘crash test’ lots of molecules quickly and inexpensively,” Breneman said. “That prevents a lot of false starts. The ultimate pay-off of this methodology may be that it can support the rapid invention of new drugs when diseases develop quickly and threaten society.”
As drug makers increasingly target complex, chronic illness, drug development becomes far more costly and time consuming. Meanwhile, in the search for new drugs, 99.9 percent of compounds tested ultimately fail. Accordingly, drug makers want to be able to predict more accurately which compounds will produce the next blockbuster drug.
The Rensselaer research team will continue work to improve drug discovery methods, which will be carried out in the new Rensselaer Center for Biotechnology and Interdisciplinary Studies, a state-of-the-art facility scheduled to open in September 2004.