Nita Parekh

Biological Network Analysis - Using Graph Theoretic Approaches

Introduction

Graph theory is a branch of discrete mathematics applied to the study of various real-world networks and their properties including biological networks. In our group we explore various graph properties such as centrality measures, modularity, eigen spectra, etc. for the analysis of biological networks, viz., protein contact networks (residue-residue interaction networks), co-expression gene networks and metabolic networks.

Protein Structure Analysis

Understanding Plant Stress Using Network-based Approaches

Rice is an economically important food crop both in India as well as the rest of the world. Currently, the production of food crops is hampered by various stress conditions such as abiotic (e.g., drought, salinity, cold, etc.) and biotic (e.g., weeds, insects and plant pathogens) stress. Stress perceptions are translated into a cascade of molecular events involving a network of transcription factors and other early stress-responsive genes. With the availability of a large number of high-throughput data at the transcriptome, proteome and metabolome level, we are interested in a systems-level understanding of stress-response. Using such data and various bioinformatic resources focused on plants, one can study the relevant correlations at various levels.

We are currently interested in the construction and analysis of stress-specific co-expression networks using microarray and RNA-seq data. As crops are exposed to different stress conditions in the field environment, it will be interesting to identify the gene signatures and processes which are unique or shared across the various stresses. With over 50% of genes in rice lacking annotation for biological processes, condition-dependent co-expression networks of rice can be helpful in the functional annotations of uncharacterized genes. Also, conserved network-neighbourhood with model species, viz., Arabidopsis can be used to identify evolutionarily conserved processes.

To aid in systems-level studies, we are in the process of integrating stress-specific transcriptomic networks with protein-protein interactions and metabolic pathway information for rice. This resource is being built using the Neo4j framework (highly scalable native graph database) and will aid in functional analysis and network visualization in rice research.

Related Publications:
1. Meta-analysis of Drought-tolerant Genotypes in Oryza sativa: A Network-based Approach, Sanchari Sircar and Nita Parekh, (submitted, in review).
2. Protocol for Co-expression Network Construction and Stress-responsive Expression Analysis in Brachypodium, Sanchari Sircar and Nita Parekh, in Methods in Molecular Biology – Springer (in press).
3. Functional characterization of drought-responsive modules and genes in Oryza sativa: a network-based approach, Sanchari Sircar and Nita Parekh, Front. Genet. 6: 256 (2015). link

Presentations in Conferences/Workshops:
1. Meta-analytic Study of Drought-Tolerant Rice Genotypes: A Systems-based Approach, Sanchari Sircar and Nita Parekh, poster presentation in the International Conference on Statistics & Big Data Bioinformatics, 20-23 Nov, ICRISAT, Hyderabad (2016).
2. Meta-analysis of Drought-Tolerant Rice Genotypes: A Network-based Approach, Sanchari Sircar and Nita Parekh, oral presentation at the 7th Edition of YRLS Conference, 18 - 20 May, Institut Pasteur, Paris, France (2016).
3. Co-Expression Network Analysis of Rice under Drought Stress: Identifying Functional Modules and Genes, Sanchari Sircar and Nita Parekh, poster presentation in the Indo-French seminar on ''Women in Science'' through CEFIPRA, 3-5 Feb, IISC, Bangalore (2015).
4. Gene Co-expression Network Analysis of Oryza sativa under Abiotic Stress, Sanchari Sircar and Nita Parekh, poster presentation in the Symposium on Accelerating Biology, 18-20 Feb, CDAC, Pune (2014).

Genomic Variants Identification and Analysis using NGS Data

Various studies have shown the association of genomic variants to rare and genetic diseases, including cancer, by inducing functional changes in genes and regulatory regions. These variants include sequence variants, viz., SNVs and small indels and structural variants (SVs), viz., duplications, deletions, inversions and translocations. With the advent of next generation sequencing technology, there is now considerable interest in understanding the role of genomic variants in the underlying molecular mechanisms in pathogenesis. Copy-number variations (CNVs) are a form of SVs that lead to abnormal copies of large genomic regions (50 bp - 1 Mbp) in a cell. The importance of CNVs is recognized by their high prevalence in human genome (~10%) and the observation that approximately half of the reported CNVs overlap with protein-coding genes. Single nucleotide variations (SNVs) and small indels (~ 2-50 bp) are known to play an equally significant role in influencing the human trait and contribute to disease. In our group we are interested in the identification and analysis of these genomic variants. We have developed an integrated pipeline with a modular framework, SVINGS (link), for the identification and analysis of CNVs using NGS data. We are in the process of developing separate modules for the detection of small indels and SNVs, which will be integrated in SVINGS. We are currently investigating the role of the genomic variants in cancer pathogenesis and their potential in identifying biomarkers.

Related Publications:
1. Copy Number Variation Detection Workflow using Next Generation Sequencing Data, Prashanthi Dharanipragada and Nita Parekh, in IEEE proceedings of International Conference on Bioinformatics and Systems Biology (BSB-2016) 4-6 March 2016, IIIT Allahabad. link

Presentations in Conferences/Workshops:
1. SVINGS: Structural Variants Identification in Next Generation Sequence data, Prashanthi Dharanipragada, Sriharsha Vogeti and Nita Parekh, oral presentation at 8th Edition of Young Researchers in Life Science (YRLS) Conference, 15-17 May, Institut Imagine, Paris, France (2017).
2. Copy Number Variation Detection Workflow using Next Generation Sequencing Data, Prashanthi Dharanipragada and Nita Parekh, oral presentation in International Conference on Bioinformatics and Systems Biology (BSB-2016) on 4th-6th March 2016, IIIT, Allahabad.
3. Functional Analysis of Copy Number Variations in DLBCL Pathogenesis, Prashanthi Dharanipragada and Nita Parekh, poster presented in Accelerating Biology 2016 - Decoding the Deluge, January 19-21, CDAC Pune (2016).
4. Detection of Copy Number Variations from Next Generation Sequencing Data, Sriharsha Vogeti, Prashanthi Dharanipragada, Anwesha Mohapatra, Shanta Pendkar and Nita Parekh, poster presentation in International Conference on Systems Biology (ICSB), Nov 23-24, Biopolis, Singapore (2015).
5. Copy Number Variation Analysis of Diffuse Large B-Cell Lymphoma (DLBCL) Subtypes, Prashanthi Dharanipragada and Nita Parekh, poster presentation at 19th International Conference on Research in Computational Molecular Biology (RECOMB), 12-15 April, Warsaw, Poland (2015).
6. Copy Number Variation Analysis of Diffuse Large B-Cell Lymphoma (DLBCL) Subtypes, Prashanthi Dharanipragada and Nita Parekh, poster presentation at Big Data Analysis and Translation in Disease Biology, Indo-US Bilateral Conference-cum-Workshop, 18-22 January, JNU, New Delhi (2015).

Metabolomics

The metabolome represents the collection of all metabolites in a biological cell, tissue, organ or organism that are the end products of cellular processes and a systematic study of these small molecules is called metabolomics. We have developed an integrated web-based platform, Computational Core of Plant Metabolomics (CCPM) that provides data repository, analysis and visualization of mass spectral data. It provides an end-to-end analysis of LC/GC-MS data involving raw data capture, data pre-processing, data pre-treatment, statistical and pathway analysis, with option for customization of parameters from the web interface.

The metabolic network, a complex network including all metabolites and enzyme catalyzed reactions occurring within a living cell, as well as the interactions between the reactants and enzymes, is an abstract representation of cellular metabolism. The topology of metabolic networks reflects the dynamics of their formation and evolution and graph theory have proved to be useful in such analysis. Graph centrality measures are useful in identifying important metabolites and enzymes and modularity measures to identify pathways conserved over evolution. We are presently carrying out graph-based analysis of substrate-centric and enzyme-centric metabolic networks of Arabidopsis thaliana.

Related Publications:
1. Construction and Analysis of Enzyme Centric Network of A. thaliana using Graph Theory, Kasthuribai Viswanathan and Nita Parekh, in A. N. Averkin, D. I. Ignatov, S. Mitra, J. Poelmans,V. B. Tarasov (Eds.), proceedings of International Workshop on Soft Computing Applications and Knowledge Discovery (SCAKD’11), 125-134, (2011). ISSN:1613-0073.link
2. Construction and Analysis of Metabolic Network of Arabidopsis thaliana Pathways, Kasthuribai Viswanathan and Nita Parekh, in proceedings of The 12th International Conference on Bioinformatics & Computational Biology (BIOCOMP'11), Ed. Hamid R. Arabnia, Quoc-Nam Tran, 367-372, (2011). ISBN:1-60132-172-4. link

Presentations in Conferences/Workshops:
1. CCPM V3.4: Towards Collaborative Metabolomics, I. Ghosh, A. Mitra, Nita Parekh, V. Pudi, B. Chakrabarty, P. Dharanipragada, R. Gurrapu, K. Narendra Babu, S. Manoj Kumar, S. R. Kiran Raj, M. Manoj Kumar, V.P. Srivani, V. Dharma Teja, poster presentation in 3rd International Plant Physiology Congress (IPPC-2015), Dec 11-14, JNU, New Delhi (2015).
2. CCPM V3.4: Towards Collaborative Metabolomics, I. Ghosh, A. Mitra, Nita Parekh, V. Pudi, B. Chakrabarty, P. Dharanipragada, R. Gurrapu, K. Narendra Babu, S. Manoj Kumar, S. R. Kiran Raj, M. Manoj Kumar, V.P. Srivani, V. Dharma Teja, poster presentation in 7th Annual Meeting of Proteomics Society - India (PSI), Dec 3-6, VIT University, Vellore (2015).
3. Comparative Analysis of Metabolic Networks, Shubhi Gupta and Nita Parekh, poster presentation at International Conference on Frontiers of Interface between Statistics and Sciences, CR Rao AIMSCS, Hyderabad, 30 Dec 2009-2 Jan 2010.