Annotation of Plant Genome: A Case Study of Oryza sativa

  • Harpreet Kaur
  • Rajan Keshri
  • Tanzeel Tufail Mir
Keywords: senescence, interaction, signals, plant genome annotation, plant ontologies, plant gene family data bases, genome annotation pipelines, Functional annotation, Annotation Repetitive Sequence Functional Description

Abstract

Rice! A perennial claim crop of the world. Besides satisfying the eager of energy rice, has also been known to support worlds trade economy. Hence, being a crop of such crucial importance its examinational study at genome level will serve in multiplying its production and quality to irrigate the burning crave of humanity.  Likewise, the senescence gene of rice is responsible for its age duration. Hence, understanding its property at 360° will help us to modify or to alter its function in positive portion.

Using Insilco analysis mode, the present study is an attempt to examine various characteristics conformation of senescence causing gene in rice. The two gene chosen were HCP and RR because, the interaction in between these two led to the onset of senescence in rice. Two gene that is HCP (Histidine-containing phosphotransfer protein 1) and RR (Two-component response regulator) are responsible for attaining the stage of senescence in rice. Understanding their molecular and structural property will be going to let us closer to perform successful adjustments. Moreover, their specific property is also responsible for their specific interaction which led to generation of such signals that triggers senescence. Therefore, this analysis was aimed to understand the features of the two genes as well as their interaction by the means of computational technique.

Understanding the features, function and flow of gene will lead us to stabilized effective measure in order to get a beneficiary outcome while going for alteration in its characters. As the pure data for the structure conformation of the selected genes are not available so, we have at first, searched the most similar homolog of the query sequence and the search was based on similar sequence homology on the platform of local alignment tool. And further analysis was carried out on the base conformation of the most relevant homologs (structure/sequence) found.

We have analyze the query gene sequence by various dry lab analysis tool to explore its structural and molecular features with the motive to contribute a little knowledge for the sake of further studies to delay senescence in rice plant in order to increase grain productivity.

Downloads

Download data is not yet available.

References

Altschul, S. F., Gish, W., Miller, W., Myers, E. W., & Lipman, D. J. (1990). Basic local alignment search tool. Journal of Molecular Biology, 215(3), 403–410. https://doi.org/10.1016/S0022-2836(05)80360-2

Gish, W., & States, D. J. (1993). Identification of protein coding regions by database similarity search. Nature Genetics, 3(3), 266–272. https://doi.org/10.1038/ng0393-266

Madden, T. L., Tatusov, R. L., & Zhang, J. (1996). [9] Applications of network BLAST server. In Methods in Enzymology (Vol. 266, pp. 131–141). Elsevier. https://doi.org/10.1016/S0076-6879(96)66011-X

Altschul, S. (1997). Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Research, 25(17), 3389–3402. https://doi.org/10.1093/nar/25.17.3389

Zhang, Z., Schwartz, S., Wagner, L., & Miller, W. (2000). A Greedy Algorithm for Aligning DNA Sequences. Journal of Computational Biology, 7(1–2), 203–214. https://doi.org/10.1089/10665270050081478

Zhang, J., & Madden, T. L. (1997). PowerBLAST: A New Network BLAST Application for Interactive or Automated Sequence Analysis and Annotation. Genome Research, 7(6), 649–656. https://doi.org/10.1101/gr.7.6.649

Morgulis, A., Coulouris, G., Raytselis, Y., Madden, T. L., Agarwala, R., & Schäffer, A. A. (2008). Database indexing for production MegaBLAST searches. Bioinformatics, 24(16), 1757–1764. https://doi.org/10.1093/bioinformatics/btn322

Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., & Madden, T. L. (2009). BLAST+: Architecture and applications. BMC Bioinformatics, 10(1), 421. https://doi.org/10.1186/1471-2105-10-421

Boratyn, G. M., Schäffer, A. A., Agarwala, R., Altschul, S. F., Lipman, D. J., & Madden, T. L. (2012). Domain enhanced lookup time accelerated BLAST. Biology Direct, 7(1), 12. https://doi.org/10.1186/1745-6150-7-12

Larkin, M. A., Blackshields, G., Brown, N. P., Chenna, R., McGettigan, P. A., McWilliam, H., Valentin, F., Wallace, I. M., Wilm, A., Lopez, R., Thompson, J. D., Gibson, T. J., & Higgins, D. G. (2007). Clustal W and Clustal X version 2.0. Bioinformatics, 23(21), 2947–2948. https://doi.org/10.1093/bioinformatics/btm404

Goujon, M., McWilliam, H., Li, W., Valentin, F., Squizzato, S., Paern, J., & Lopez, R. (2010). A new bioinformatics analysis tools framework at EMBL-EBI. Nucleic Acids Research, 38(Web Server), W695–W699. https://doi.org/10.1093/nar/gkq313

Latysheva, N. S., & Babu, M. M. (2016). Discovering and understanding oncogenic gene fusions through data intensive computational approaches. Nucleic Acids Research, 44(10), 4487–4503. https://doi.org/10.1093/nar/gkw282

Dereeper, A., Guignon, V., Blanc, G., Audic, S., Buffet, S., Chevenet, F., Dufayard, J.-F., Guindon, S., Lefort, V., Lescot, M., Claverie, J.-M., & Gascuel, O. (2008). Phylogeny.fr: Robust phylogenetic analysis for the non-specialist. Nucleic Acids Research, 36(Web Server), W465–W469. https://doi.org/10.1093/nar/gkn180

Dereeper, A., Audic, S., Claverie, J.-M., & Blanc, G. (2010). BLAST-EXPLORER helps you building datasets for phylogenetic analysis. BMC Evolutionary Biology, 10(1), 8. https://doi.org/10.1186/1471-2148-10-8

Biasini, M., Bienert, S., Waterhouse, A., Arnold, K., Studer, G., Schmidt, T., Kiefer, F., Cassarino, T. G., Bertoni, M., Bordoli, L., & Schwede, T. (2014). SWISS-MODEL: Modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Research, 42(W1), W252–W258. https://doi.org/10.1093/nar/gku340

Bienert, S., Waterhouse, A., de Beer, T. A. P., Tauriello, G., Studer, G., Bordoli, L., & Schwede, T. (2017). The SWISS-MODEL Repository—New features and functionality. Nucleic Acids Research, 45(D1), D313–D319. https://doi.org/10.1093/nar/gkw1132

Guex, N., Peitsch, M. C., & Schwede, T. (2009). Automated comparative protein structure modeling with SWISS-MODEL and Swiss-PdbViewer: A historical perspective. ELECTROPHORESIS, 30(S1), S162–S173. https://doi.org/10.1002/elps.200900140

Benkert, P., Biasini, M., & Schwede, T. (2011). Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics, 27(3), 343–350. https://doi.org/10.1093/bioinformatics/btq662

Bertoni, M., Kiefer, F., Biasini, M., Bordoli, L., & Schwede, T. (2017). Modeling protein quaternary structure of homo- and hetero-oligomers beyond binary interactions by homology. Scientific Reports, 7(1), 10480. https://doi.org/10.1038/s41598-017-09654-8

Gaudet, P., Livstone, M. S., Lewis, S. E., & Thomas, P. D. (2011). Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium. Briefings in Bioinformatics, 12(5), 449–462. https://doi.org/10.1093/bib/bbr042

GARNIER J. (1998). GOR secondary structure prediction method version IV. Meth. Enzym., R.F. Doolittle Ed., 266, 540-553.

Kloczkowski, A., Ting, K.-L., Jernigan, R. L., & Garnier, J. (2002). Protein secondary structure prediction based on the GOR algorithm incorporating multiple sequence alignment information. Polymer, 43(2), 441–449. https://doi.org/10.1016/S0032-3861(01)00425-6

T L Bailey and C Elkan. (1994). Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol., 2, 28-36.

Sun, L., Zhang, Q., Wu, J., Zhang, L., Jiao, X., Zhang, S., Zhang, Z., Sun, D., Lu, T., & Sun, Y. (2014). Two Rice Authentic Histidine Phosphotransfer Proteins, OsAHP1 and OsAHP2, Mediate Cytokinin Signaling and Stress Responses in Rice. Plant Physiology, 165(1), 335–345. https://doi.org/10.1104/pp.113.232629

Sakai, H. (2001). ARR1, a Transcription Factor for Genes Immediately Responsive to Cytokinins. Science, 294(5546), 1519–1521. https://doi.org/10.1126/science.1065201

Schrodinger, L.L.C., 2017. The PyMol molecular graphics system, (v2.0). Schrödinger, LLC, NEW YORK,

Szklarczyk, D., Morris, J. H., Cook, H., Kuhn, M., Wyder, S., Simonovic, M., Santos, A., Doncheva, N. T., Roth, A., Bork, P., Jensen, L. J., & von Mering, C. (2017). The STRING database in 2017: Quality-controlled protein–protein association networks, made broadly accessible. Nucleic Acids Research, 45(D1), D362–D368. https://doi.org/10.1093/nar/gkw937

Szklarczyk, D., Franceschini, A., Wyder, S., Forslund, K., Heller, D., Huerta-Cepas, J., Simonovic, M., Roth, A., Santos, A., Tsafou, K. P., Kuhn, M., Bork, P., Jensen, L. J., & von Mering, C. (2015). STRING v10: Protein–protein interaction networks, integrated over the tree of life. Nucleic Acids Research, 43(D1), D447–D452. https://doi.org/10.1093/nar/gku1003

Chen, X., Yang, J.-R., Guan, N.-N., & Li, J.-Q. (2018). GRMDA: Graph Regression for MiRNA-Disease Association Prediction. Frontiers in Physiology, 9, 92. https://doi.org/10.3389/fphys.2018.00092

Franceschini, A., Szklarczyk, D., Frankild, S., Kuhn, M., Simonovic, M., Roth, A., Lin, J., Minguez, P., Bork, P., von Mering, C., & Jensen, L. J. (2012). STRING v9.1: Protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Research, 41(D1), D808–D815. https://doi.org/10.1093/nar/gks1094

Sukhwal, A., & Sowdhamini, R. (2013). Oligomerisation status and evolutionary conservation of interfaces of protein structural domain superfamilies. Molecular BioSystems, 9(7), 1652. https://doi.org/10.1039/c3mb25484d

Published
2020-09-01
How to Cite
Harpreet Kaur, Rajan Keshri, & Tanzeel Tufail Mir. (2020). Annotation of Plant Genome: A Case Study of Oryza sativa. International Journal for Research in Applied Sciences and Biotechnology, 7(5), 12-41. https://doi.org/10.31033/ijrasb.7.5.3