Available Bioinformatics Tools for Epigenetics and Non-coding RNA
Received: 06-Mar-2021 Accepted Date: Mar 20, 2021 ; Published: 27-Mar-2021
Citation: Katakam AP 2021. Available Bioinformatics Tools for Epigenetics and Non-coding RNA. EJBI. 17 3 15
This open-access article is distributed under the terms of the Creative Commons Attribution Non-Commercial License (CC BY-NC) (http://creativecommons.org/licenses/by-nc/4.0/), which permits reuse, distribution and reproduction of the article, provided that the original work is properly cited and the reuse is restricted to noncommercial purposes. For commercial reuse, contact [email protected]
IntroductionModern study of living things/qualities of living things is seen as the generation of huge data sets which require a computer based treatment in order to produce useful information for them and their use is more important in biological research due to irreversible data- oriented trends in field, as metagenomics is form of putting DNA in correct order which DNA sequencing present in particular sample as the researcher are finding novel genes with encoded metagenomes it may get evolved in pharmaceutical industry, bioinformatics tools are for assembly and annotation of sequence data with development of new novel genes.
Bioinformatics Tool, Visual Representation, Phylogenetic Analysis, Artificial Intelligence, Neural network, Machine Learning
There are two ways to study micro biomes using highthroughput sequencing- marker-gene studies, whole genome shotgun (WGS) metagenomics. Marker-gene studies are designed to PCR amplifies particular genomes e.g.: bacteria, archea or fungi, the resulted product are sequenced this is fast and cheap method which isn’t used for gene encoded in part of Meta genomes that remained un-sequenced.
Tools used in bioinformatics
Sequencing Technologies for Whole Genome Shotgun Metagenomics: WGS-shotgun-metagenomics is an alternative complementary method metagenomics is application of sequencing technologies for genomic material present in microbiome of present sample. It also provide information about function, structure, organization of genes, novel genes identification and biocatalysts, advances in this sequencing technologies have provided hundreds of gigabases of DNA sequencing at very less cost and provide wide range of enzyme and biocatalyst applications in marketplace and biotechnology, pharmaceutical industry.
Many microbiomes are complex incredibly for these the sequencing technologies have enabled much deeper and second generation sequence have technologies like Illumina and Ion Torrent and third generation have Pac bio and ONT which have longer reads and not widely used as it have high rate of homopolymer.
Metagenomics assembly: DNA sequence fragment of genomes assembly is process of reconstruction in silico original genomes for smaller fragments, de novo assembly software tools uses main paradigms like OLC and de Brujin graph- it can construct graph by constructed without pairwise comparison and its most effective and less expensive than OLC approach.
Phylogenetic binning: Phylogenetic Binning is form of clustering genomes sequence into groups with each separate biological single genome in to group that separate taxon, from which single genome is assemble connecting diverse taxon, LikelyBin is unsupervised statistical binning metagenomics fragments, PHYSCIMM is composition of species presented in public data base.
Protein domain database: Number of large published protein structure/sequence/database is collaboration of twelve databases, INTERPRO integrate information about active sites, protein families and protein activity and functioning. It combines all these characterization of sequence theses are checked for link and original publications. InterProScan is protein function prediction software which gives an input query sequence in multiple data base, when no match then it is passed.
Targeted gene discovery: When there is small amount of protein then no need to go through whole database metagenomics gene, XANDER is a gene targeted assembler uses HMMs for guiding graphs traversal; less compute intensive due to small amount of data is used.
Pathway Databases: It refer to series of action between the biomolecule with particular products, Reactome is curated database with biological pathway and with single reaction in lowest level. Kyoto Encyclopaedia of gene and genomes is phenotypically information for which it consist multiple data base, information about protein, enzyme, genomes diseases and drug pathways; different links are available like NCBI, OMIM, and Uni Prot.