We use high performance computers to study the genome information of various organisms in an effort to identify new biological phenomena and to understand how organisms evolve. In addition, we develop software tools for bioinformatics and molecular biology. We also operate a computer system that can be used to analyze genome sequence data. This system is available to researchers in our university and we hold training courses in genome analysis at least once every year.
(1) Large-scale analysis of genomes
Today, the complete genome sequences of more than 1,000 organisms are available. We are currently analyzing this enormous body of genome data by using bioinformatics and molecular evolution techniques. We are also developing software and algorithms that facilitate large-scale genome analysis. These include CONSERV, a conserved sequence finder. When this algorithm was used to analyze the complete genome sequences of 266 organisms, it identified invariant sequences that may have been present in the last common ancestor of all extant life forms (Goto et al, 2007; Fig. 1). In addition, we are striving to understand the evolutionary pathways of the influenza virus by comprehensively analyzing its genome (Fig. 2).
Figure 1. Large-scale genome analysis of 266 organisms revealed a sequence that is conserved in almost all known genomes. |
Figure 2. Comprehensive analysis of the influenza virus genome. |
(2) Next generation sequencer data analysis
The recently developed ¡Ènext generation sequencing¡É technology has made it possible to sequence the entire genome of any microorganism in one sequencer run. Thus, each run produces a massive amount of nucleotide sequence data. We are developing software that can handle this data and have set up an analysis system that is used for microorganism sequencing projects that are performed in collaboration with other laboratories (Fig. 3).
Figure 3. Next generation sequencing enables sequencing of an entire genome in one run. |
Figure 4. Genome Information Research Center Computer System. |
(3) Operation of a computer system that permits the analysis of genome information at Osaka University.
We provide computer resources for researchers in our university. We also provide mirrored access to major nucleotide, protein, and genome databases through our servers (Fig. 4), which are fully synchronized with the mother servers and kept up to date at all times.