Lukasz Kurgan, Ph.D.

Dean Robert J. Mattauch Chair | Vice Chair of Computer Science VCU College of Engineering

Engineering East Hall, Room E4268, Richmond VA

Data scientist specializing in high-throughput structural bioinformatics of proteins & small RNAs.

Contact

VCU College of Engineering
View more experts managed by VCU College of Engineering

View all Experts

Media

Social

Biography

Lukasz Kurgan received his M.Sc. degree (with honors) in Automation and Robotics from AGH University of Science and Technology (Poland) in 1999 and a Ph.D. degree in Computer Science from University of Colorado at Boulder in 2003. He joined the University of Alberta in 2003 where he received tenure in 2007 and was promoted to the rank of Professor in 2013. He moved to the Virginia Commonwealth University in 2016 as the Robert J. Mattauch Endowed Professor of Computer Science.

Industry Expertise

Education/Learning

Research

Computer Software

Biotechnology

Pharmaceuticals

Areas of Expertise

Structural Bioinformatics

Intrinsically Disordered Proteins

Protein-ligand(drug) interactions

Computer-aided molecular modeling

Big Data Analysis

Drug Repurposing

Drug Repositioning

Structural Genomics

Accomplishments

Member of Faculty Opinions

2021-09-02

Inducted as member of the "Big Data & Analytics" section of the "Bioinformatics, Biomedical Informatics & Computational Biology" area.

Author of the winning flDPnn algorithm of the international Critical Assessment of Protein Intrinsic Disorder Prediction (CAID) challenge

2021-04-19

CAID is a worldwide competition that identifies the most accurate methods that predict the intrinsically disordered protein regions. The results were recently published in Nature Methods (https://www.nature.com/articles/s41592-021-01117-3), followed by a commentary article in the same journal that highlights our win (https://www.nature.com/articles/s41592-021-01123-5).

Fellow of the Kosciuszko Foundation Collegium of Eminent Scientists

2018-01-30

With citation for "outstanding achievements and contributions to the Polish scientific community."

Education

University of Colorado at Boulder

Ph.D.

Computer Science

2003

University of Science and Technology (Poland)

M.Sc.

Automation and Robotics

1999

Affiliations

Professor Department of Computer Science Virginia Commonwealth University
Adjunt Professor Department of Electrical and Computer Engineering University of Alberta

Media Appearances

Computer science research team gains international recognition for method that accurately predicts intrinsic disorder in proteins

VCU news online

2021-05-19

A computer science research team from VCU Engineering won an international challenge for their novel method of predicting intrinsically disordered proteins. Kurgan's award-winning method now appears in the journal Nature Communications (https://www.nature.com/articles/s41467-021-24773-7). The editors of Nature Communications also placed Kurgan's article on the Editor's Highlights page, which features a small selection of articles the editorial team believes to be particularly interesting or important.

VCU professors join elite bioengineering institute

Commonwealth Times online

2018-04-16

Three professors were inducted into the American Institute for Medical and Biological Engineering (AIMBE) at a formal ceremony on April 9, 2018. Kurgan was nominated for his work in structural bioinformatics, using computer programs to study the structures of proteins and DNA.

VCU's Kurgan supercomputer programs help biologists to speed up hypothesis generation to understand proteins

Supercomputing Online News online

2017-07-24

“We have manually curated but understand less than 1 percent of these proteins, and right now there’s over 80 million to solve,” said Kurgan, a Qimonda-endowed professor and data scientist. “A program can solve these proteins faster than a single human and can help researchers speed up hypothesis generation.”

Research Grants

Integrated prediction of intrinsic disorder and disorder functions with modular multi-label deep learning

NSF

2021-08-31

Proteins are remarkable biological machines. Hundreds of millions of protein sequences were decoded over the last two decades creating a significant knowledge gap related to the fact that we do not know what most of them do. A common way to decipher protein functions relies on the sequence-to-structure-to-function paradigm where protein function is learned from the protein structure that is produced from the sequence. However, recent research has identified a large family of the intrinsically disordered proteins that lack a stable structure under physiological conditions and which therefore cannot be characterized using the structure-based approaches. These proteins are particularly abundant in the eukaryotes and are involved in the pathogenesis of numerous human diseases. The discovery of the intrinsically disordered proteins has prompted the development of a new generation of computational methods that predict presence of intrinsic disorder directly from protein sequences. A recently completed Critical Assessment of protein Intrinsic Disorder prediction (CAID) experiment has shown that these methods are fast and provide accurate results. However, while intrinsic disorder can be readily and accurately identified in protein sequences, its function remains a mystery. This proposal will conceptualize, design, implement, test and deploy an innovative machine learning method that provides highly accurate and integrated predictions of disorder and disorder functions directly from protein sequences. The team will utilize this method to produce functional annotations of disorder on an unprecedented scale of dozens of millions of proteins, addressing the knowledge gap problem for this protein family. In the long run this project will advance understanding of fundamental biological processes and related human health issues in the context of the intrinsically disordered proteins. This project will also train STEM students and researchers via high-school outreach and multidisciplinary teaching and mentoring of undergraduate and graduate students and postdoctoral researchers, producing highly skilled researchers who are sought after by industry and academia.

High-throughput annotation of cellular functions of intrinsic disorder in proteins

NSF

2016-10-01

One of fundamental problems in molecular biology is to decipher functions of millions of uncharacterized protein sequences that are rapidly generated by high-throughput genome sequencing. The sequence-to-structure-to-function paradigm was used for decades to determine functions of proteins. However, recent research has broadened this paradigm by adding new players, proteins with intrinsic disorder (ID). They are highly abundant and cannot be solved with the currently used structure-driven approach. While there are many widely used computational methods that accurately predict ID in protein sequences, methods for the prediction of the many functions of ID are lacking. This project will develop a family of novel, accurate, and high-throughput computational methods that predict all major functions of ID in protein sequences. It will produce putative functional annotations on an unprecedented scale of thousands of species, addressing the problem of high rate acquisition of raw sequence data and contributing to the increase of the rate of scientific discovery. These results will advance our understanding of fundamental biological processes and human health given the high prevalence of ID in human diseases and attractiveness of proteins with ID as drug targets.

High-throughput characterization, prediction, and applications of protein disorder

NSERC

2012-03-01

For years, scientists were convinced that proteins must fold into precise, rigid molecules to allow proteins to function correctly. This view is changing now. The intrinsically disordered proteins have at least some disordered (also called unfolded/highly flexible) parts and many of them carry out their function without ever fully folding into a rigid molecule. The disorder is highly abundant in nature and its prevalence was shown in several human diseases. However, the characterization of protein disorder is lagging behind the rapidly growing number of known proteins. Experimental annotations of disorder are time consuming and difficult and thus computational methods that predict disorder from protein sequences have emerged as a viable alternative to bridge the annotation gap and to investigate the disorder. Although the quality of these predictors continues to rise, more accurate methods and novel methods that address specific characteristics of disorder are urgently needed. Moreover, there is a pressing need to understand and characterize disorder in various proteomes and functional classes of proteins. To this end, our objectives include (1) development of a comprehensive computational platform for accurate, fast, and multi-objective prediction of disorder; and (2) applications and experimental validation of disorder predictions. This work facilitates a more complete understanding of the protein disorder, principles of protein folding, and molecular mechanisms of protein function. Our methods provide a cost and time effective solution to guide experimentalists, and they are crucial for modern research and development in several areas, including rational drug design, structural genomics, and systems biology.

Early prediction of patient-related and radiological outcomes in patients with recent-onset inflammatory polyarthritis (EPA) using established and novel independent predictors

CIHR

2011-04-01

Early inflammatory polyarthritis (EPA) describes recent-onset disease with signs of inflammation in at least 3 peripheral joints, typically starts between 40 and 55 years of age, affects up to 5% of adults over their lifetime, and results in persistent inflammatory arthritis in close to 2% (30-50% of EPA). EPA patients are clinically very similar at onset, and their prognosis remains ill-defined and frequently poor, despite the availability of effective medications and the use of remission-targeted strategies. The lack at baseline of effective prognostic markers to identify patients needing these interventions is in part responsible for missing the window of opportunity for treatment in many patients. Based on previous observations that individuals segregate into poor and good in vitro activators of bone cells called osteoclasts (OC), we propose to identify characteristics of OC precursors and of OCs formed in vitro from patients' blood cells to correlate these characteristics with severe joint damage. As RA patients have short-for-age telomeres (i.e. DNA sequences at the ends of chromosomes), we propose to determine whether short telomeres at baseline (and rapidly shortening telomeres soon thereafter) are independent predictors of severe RA-like disease in EPA patients. We will also define the role of ultrasound joint evaluation in patients who do not have bone erosions on Xrays to predict which ones will develop severe joint damage. Finally, we will look at variants of immune-related genes and at the psychosocial characteristics (e.g. depression, coping strategies, pain perception) that may predict poor pain improvement and poor outcomes. The combination of these prognostic markers will lead to a prognostic tool that may guide early treatment (both biomedical and psychosocial) targeted to those patients most likely to benefit (cost-saving) and avoid unnecessary exposure to expensive and potentially toxic drugs when these are not needed.

Courses

CMSC 435 Introduction to Data Science

Virginia Commonwealth University

CMSC 635 Knowledge Discovery and Data Mining

Virginia Commonwealth University

ECE 321 Software Requirements Engineering

University of Alberta

Selected Articles

Intrinsic Disorder in Human RNA-Binding Proteins

Journal of Molecular Biology

2021-10-15

Although RNA-binding proteins (RBPs) are known to be enriched in intrinsic disorder, no previous analysis focused on RBPs interacting with specific RNA types. We fill this gap with a comprehensive analysis of the putative disorder in RBPs binding to six common RNA types: messenger RNA (mRNA), transfer RNA (tRNA), small nuclear RNA (snRNA), non-coding RNA (ncRNA), ribosomal RNA (rRNA), and internal ribosome RNA (irRNA). We also analyze the amount of putative intrinsic disorder in the RNA-binding domains (RBDs) and non-RNA-binding-domain regions (non-RBD regions). Consistent with previous studies, we show that in comparison with human proteome, RBPs are significantly enriched in disorder. However, closer examination finds significant enrichment in predicted disorder for the mRNA-, rRNA- and snRNA-binding proteins, while the proteins that interact with ncRNA and irRNA are not enriched in disorder, and the tRNA-binding proteins are significantly depleted in disorder. We show a consistent pattern of significant disorder enrichment in the non-RBD regions coupled with low levels of disorder in RBDs, which suggests that disorder is relatively rarely utilized in the RNA-binding regions. Our analysis of the non-RBD regions suggests that disorder harbors posttranslational modification sites and is involved in the putative interactions with DNA. Importantly, we utilize experimental data from DisProt and independent data from Pfam to validate the above observations that rely on the disorder predictions. This study provides new insights into the distribution of disorder across proteins that bind different RNA types and the functional role of disorder in the regions where it is enriched.

flDPnn: Accurate intrinsic disorder prediction with putative propensities of disorder functions

Nature Communications

2021-07-21

Identification of intrinsic disorder in proteins relies in large part on computational predictors, which demands that their accuracy should be high. Since intrinsic disorder carries out a broad range of cellular functions, it is desirable to couple the disorder and disorder function predictions. We report a computational tool, flDPnn, that provides accurate, fast and comprehensive disorder and disorder function predictions from protein sequences. The recent Critical Assessment of protein Intrinsic Disorder prediction (CAID) experiment and results on other test datasets demonstrate that flDPnn offers accurate predictions of disorder, fully disordered proteins and four common disorder functions. These predictions are substantially better than the results of the existing disorder predictors and methods that predict functions of disorder. Ablation tests reveal that the high predictive performance stems from innovative ways used in flDPnn to derive sequence profiles and encode inputs. flDPnn’s webserver is available at http://biomine.cs.vcu.edu/servers/flDPnn/

DescribePROT: database of amino acid-level protein structure and function predictions

Nucleic Acids Research

2020-10-29

We present DescribePROT, the database of predicted amino acid-level descriptors of structure and function of proteins. DescribePROT delivers a comprehensive collection of 13 complementary descriptors predicted using 10 popular and accurate algorithms for 83 complete proteomes that cover key model organisms. The current version includes 7.8 billion predictions for close to 600 million amino acids in 1.4 million proteins. The descriptors encompass sequence conservation, position specific scoring matrix, secondary structure, solvent accessibility, intrinsic disorder, disordered linkers, signal peptides, MoRFs and interactions with proteins, DNA and RNAs. Users can search DescribePROT by the amino acid sequence and the UniProt accession number and entry name. The pre-computed results are made available instantaneously. The predictions can be accesses via an interactive graphical interface that allows simultaneous analysis of multiple descriptors and can be also downloaded in structured formats at the protein, proteome and whole database scale. The putative annotations included by DescriPROT are useful for a broad range of studies, including: investigations of protein function, applied projects focusing on therapeutics and diseases, and in the development of predictors for other protein sequence descriptors. Future releases will expand the coverage of DescribePROT. DescribePROT can be accessed at http://biomine.cs.vcu.edu/servers/DESCRIBEPROT/.

Resilience of death: intrinsic disorder in proteins involved in the programmed cell death

Cell Death and Differentiation

2013-06-14

It is recognized now that intrinsically disordered proteins (IDPs), which do not have unique 3D structures as a whole or in noticeable parts, constitute a significant fraction of any given proteome. IDPs are characterized by an astonishing structural and functional diversity that defines their ability to be universal regulators of various cellular pathways. Programmed cell death (PCD) is one of the most intricate cellular processes where the cell uses specialized cellular machinery and intracellular programs to kill itself. This cell-suicide mechanism enables metazoans to control cell numbers and to eliminate cells that threaten the animal’s survival. PCD includes several specific modules, such as apoptosis, autophagy, and programmed necrosis (necroptosis). These modules are not only tightly regulated but also intimately interconnected and are jointly controlled via a complex set of protein–protein interactions. To understand the role of the intrinsic disorder in controlling and regulating the PCD, several large sets of PCD-related proteins across 28 species were analyzed using a wide array of modern bioinformatics tools. This study indicates that the intrinsic disorder phenomenon has to be taken into consideration to generate a complete picture of the interconnected processes, pathways, and modules that determine the essence of the PCD. We demonstrate that proteins involved in regulation and execution of PCD possess substantial amount of intrinsic disorder. We annotate functional roles of disorder across and within apoptosis, autophagy, and necroptosis processes. Disordered regions are shown to be implemented in a number of crucial functions, such as protein–protein interactions, interactions with other partners including nucleic acids and other ligands, are enriched in post-translational modification sites, and are characterized by specific evolutionary patterns. We mapped the disorder into an integrated network of PCD pathways and into the interactomes of selected proteins that are involved in the p53-mediated apoptotic signaling pathway.

Lukasz Kurgan, Ph.D.

VCU College of Engineering

Media

Links

Social

Biography

Industry Expertise

Areas of Expertise

Accomplishments

Member of Faculty Opinions

Author of the winning flDPnn algorithm of the international Critical Assessment of Protein Intrinsic Disorder Prediction (CAID) challenge

Fellow of the Kosciuszko Foundation Collegium of Eminent Scientists

Fellow of the American Institute for Medical and Biomedical Engineering (AIMBE)

Senior Member of ACM

Gold Medal of Stanislaw Staszic

Outstanding Graduate Student Award

Education

University of Colorado at Boulder

University of Science and Technology (Poland)

Affiliations

Media Appearances

Computer science research team gains international recognition for method that accurately predicts intrinsic disorder in proteins

VCU professors join elite bioengineering institute

VCU's Kurgan supercomputer programs help biologists to speed up hypothesis generation to understand proteins

Bioinformatics computer programs help biologists understand intrinsically disordered proteins

Unravelling the Complexity of Proteins

Crystallography for Complete Proteomes

Zeroing In...

Research Grants

Integrated prediction of intrinsic disorder and disorder functions with modular multi-label deep learning

High-throughput annotation of cellular functions of intrinsic disorder in proteins

High-throughput characterization, prediction, and applications of protein disorder

Early prediction of patient-related and radiological outcomes in patients with recent-onset inflammatory polyarthritis (EPA) using established and novel independent predictors

Molecular-level prediction and mitigation of side effects of tubulin-targeting cancer therapy drugs

Computational intelligence based platform for prediction and characterization of binding sites in proteins

Role of osteoclastogenesis and osteoclast activation in joint destruction in degenerative and inflammatory joint diseases

Courses

CMSC 435 Introduction to Data Science

CMSC 635 Knowledge Discovery and Data Mining

ECE 321 Software Requirements Engineering

ENCMP 100 Computer Programming for Engineers

EE 280 Introduction to Digital Logic Design

CMPE 310 Applying Software Engineering Practices Project

ECE 625 Data Analysis and Knowledge Discovery

ECE 625 Advanced Data Analysis and Decision Making

CSC 4811 Computer Security

CSC 5728 Software Engineering

Selected Articles

Intrinsic Disorder in Human RNA-Binding Proteins

flDPnn: Accurate intrinsic disorder prediction with putative propensities of disorder functions

DescribePROT: database of amino acid-level protein structure and function predictions

IDPology of the living cell: intrinsic disorder in the subcellular compartments of the human cell

DEPICTER: Intrinsic Disorder and Disorder Function Prediction Server

Taxonomic Landscape of the Dark Proteomes: Whole-Proteome Scale Interplay Between Structural Darkness, Intrinsic Disorder, and Crystallization Propensity

DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues

Disordered Nucleiome: Abundance of Intrinsic Disorder in the DNA- and RNA-binding Proteins in 1121 Species from Eukaryota, Bacteria and Archaea

Molecular Recognition Features (MoRFs) in Three Domains of Life

PDID: Database of Molecular-level Putative Protein-drug Interactions in the Structural Human Proteome

A Comprehensive Comparative Review of Sequence Based Pedictors of DNA and RNA Binding Residues

High-throughput prediction of RNA, DNA and protein binding regions mediated by intrinsic disorder

Comprehensive overview and assessment of computational prediction of microRNA targets in animals

Exceptionally abundant exceptions: comprehensive characterization of intrinsic disorder in all domains of life

Human structural proteome-wide characterization of Cyclosporine A targets

Interplay Between the Oxidoreductase PDIA6 and microRNA-322 Controls the Response to Disrupted Endoplasmic Reticulum Calcium Homeostasis

Disordered Proteinaceous Machines

Resilience of death: intrinsic disorder in proteins involved in the programmed cell death