HuVarbase: for comprehensive data and deciphering diseases

Proteins are large and complex molecules, which perform a vast array of functions. They are one of the four molecules of life. There are a lot of varieties of proteins present in the human body, ensuring survival and health. Useful data about these proteins are collected and stored in multiple databases, and these data are used by scientists and clinicians to understand and fight diseases. However, as there are a number of databases with different scopes, the data is widely scattered without much uniformity in structure.

Published in: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0210475
For more information, please write to gromiha@iitm.ac.in
Research: https://www.iitm.ac.in/bioinfo/Gromiha/Databases.html
Database:https://www.iitm.ac.in/bioinfo/huvarbase

Diseases and heritable disorders like cancer and diabetes are linked to mutations in proteins. Mutations are the change in the sequence of proteins. In clinical setups, a patient’s sequence data can be used to identify the presence of one particular mutation, which provides a prognosis, and predicts outcomes for particular modes of treatment. For a sophisticated analysis, we require algorithms that are advanced and precise. Researchers and clinicians who are working towards customized medicines are currently creating such databases for use in predictive algorithms and other large-scale analyses. While there are a number of relevant databases available, they usually do not provide additional details, such as how the mutation affects the protein. For example, existing databases do not give information about the protein’s location in the cell (“subcellular location”). Sometimes, the data found in one database may even conflict with the data found in another. A comprehensive database could mitigate these shortcomings.

Prof Michael Gromiha and his team at IITM have created several such databases: MutHTP for disease-causing mutations in transmembrane proteins (proteins located in the membrane), the mutational effects on protein aggregation (CPAD), protein stability (ProTherm) and binding affinity of protein-protein complexes (PROXiMATE). HuVarBase is one of the newest resources created by the team, which contains data on mutations in human proteins with comprehensive information at genes and proteins. The database is publicly available.

The first step in constructing HuVarBase was to go through the literature, and existing variant databases to collect all the necessary data. In the case of conflicting data, the team went back to the original literature sources to decide which data could be included. Further, additional features about the protein were included: the protein sequence (i.e. the order of amino acids used to build the protein), the disease class (i.e. the type of disease like cardiovascular disease or skin disease); links to structural details of the protein, protein’s location in the cell, whether the protein has any additional molecular changes (collectively termed as “post-translational modifications”), etc.

The database is equipped with advanced search options, an easy-to-follow tutorial, FAQs, and glossary, thus making it accessible to those interested in the study of protein mutations and diseases. Ensuring the reliability of data was one of the challenges faced by the team. In order to ensure that the data provided is accurate and traceable, the team has included links to the original source of the data.

“In this genomic era, a large volume of disparate “omics” data pile-up almost on a daily basis. The information content in such data sets are enormous. It is non-trivial to uncover biologically relevant and significant information from these massive data sets. One of the first requirements in such ventures is to be able to integrate the disparate data sets in a logical and effective way before querying the data to address a biological question. Gromiha and co-workers, in their development named HuVarBase, made a strategic integration of variant Omics datasets on humans. Though one talks about human genome sequence data as though it is absolute, there are variations between humans. Sometimes these variations have genetic basis leading to an explanation for the vulnerability of an individual human for example, to cancer, diabetes and like. Therefore HuVarBase should greatly aid our understanding of the molecular basis of a disease process. This forms the firm first steps in the pipeline of drug design and discovery. Use of development depends on the ease of using the web interface. HuVarBase has several useful features incorporated to enable user community to make a complex query quite easily.”
N. Srinivasan
Molecular Biophysics Unit
Indian Institute of Science, Bangalore

2 Comments

Add Yours

1

Prof.C.V.Venkatakrishnan, SCSVMV Deemed University Enathur,Kanchipuram on April 9, 2020 at 6:24 am

A preparation PEG200 -TiO2(20nm particles) polymer dispersion when added to Til oil(gingiley oil ) can act as COVID-19 control sprayed onto virus exposed masks(invitro trial). Same prepared as aerosol can be tried invivo.
2

Prof.C.V.Venkatakrishnan, SCSVMV Deemed University Enathur,Kanchipuram on April 9, 2020 at 6:28 am

A preparation PEG200 -TiO2(20nm particles) polymer dispersion when added to Til oil(gingiley oil ) can act as COVID-19 control sprayed onto virus exposed masks(invitro trial). Same prepared as aerosol can be tried invivo. Exact weight-Volume percentages of preparation I will provide when you contact.

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months	This cookie is set by Cookie Consent. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by Cookie Consent. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by Cookie Consent. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by Cookie Consent. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the Cookie Consent and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_137172037_4	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.

HuVarbase: for comprehensive data and deciphering diseases

You’re a Whole New Generation!

Treading the Territories of Copulas

How’s Your Smartwatch?

An Agent of Chaos

A Golden Delivery

To Predict a Storm

Miniature Feature

Reading circuits in the brain

Bor(o)n for Lithium

Your Identity, Your Privacy

itle

itle

2 Comments

Leave a Reply to Prof.C.V.Venkatakrishnan, SCSVMV Deemed University Enathur,Kanchipuram Cancel reply