1.Carnobacterium Isolated from Caviar of Sturgeon (Acipenser ruthenus) Farmed in Korea.
Donguk KIM ; Kookhee KANG ; Haekyung CHEON ; Jisoon IM ; Kwisung PARK
Journal of Bacteriology and Virology 2015;45(2):151-154
Sturgeon aqua-cultured in Korea is mainly Acipenser ruthenus and its culture began in the early 2000's. In this study, Carnobacterium sp. was isolated from unprocessed caviar of aqua-cultured Acipenser ruthenus. The 16s rRNA nucleotide sequence obtained from Carnobacterium sp. isolate (accession no. KM236206) was deposited with GenBank and homologous with Carnobacterium divergens DSM 20623 and NBRC15683 strain. In conclusion, this is first report of isolation of Carnobacterium sp. from caviar of Acipenser ruthenus aqua-cultured in Korea. In the future, it must be ascertained whether Carnobacterium sp. degenerate of caviar or cause diseases in sturgeon.
Base Sequence
;
Carnobacterium*
;
Databases, Nucleic Acid
;
Korea
2.WinBioDBs: A Windows-based Integrated Program for Manipulating Major Biological Databases.
Hyeweon NAM ; Jin Ho LEE ; Kiejung PARK
Genomics & Informatics 2009;7(3):175-177
We have developed WinBioDBs with Windows interfaces, which include importing modules and searching interfaces for 10 major public databases such as GenBank, PIR, SwissProt, Pathway, EPD, ENZYME, REBASE, Prosite, Blocks, and Pfam. User databases can be constructed with searching results of queries and their entries can be edited. The program is a stand-alone database searching program on Windows PC. Database update features are supported by importing raw database files and indexing after downloading them. Users can adjust their own searching environments and report format and construct their own projects consisting of a combination of a local databases. WinBioDBs are implemented with VC++ and its database is based on MySQL.
Abstracting and Indexing as Topic
;
Databases, Nucleic Acid
;
Databases, Protein
3.BioStore: A Repository System for Registering and Distributing Public Biology Databases.
Hongseok TAE ; Jeong Min HAN ; Bu Young AHN ; Kiejung PARK
Genomics & Informatics 2009;7(1):49-51
Although abundant biology data have been accumulated in public biology databases, such as GenBank and PIR, few easy-interface services are provided for users to access or update them. We have developed a system, named BioStore, that is composed of several programs to aid users to not only access public data but also share their own data easily. The service can be used for maintaining a local database as a repository of raw data files of several public databases and distributing the data files to other users. Currently, BioStore manipulates major bio-databases and will expand to include more databases and more useful interfaces.
Biology
;
Databases, Nucleic Acid
;
Formycins
;
Ribonucleotides
;
Information Storage and Retrieval
4.An Efficient Approach to Mining Maximal Contiguous Frequent Patterns from Large DNA Sequence Databases.
Md Rezaul KARIM ; Md Mamunur RASHID ; Byeong Soo JEONG ; Ho Jin CHOI
Genomics & Informatics 2012;10(1):51-57
Mining interesting patterns from DNA sequences is one of the most challenging tasks in bioinformatics and computational biology. Maximal contiguous frequent patterns are preferable for expressing the function and structure of DNA sequences and hence can capture the common data characteristics among related sequences. Biologists are interested in finding frequent orderly arrangements of motifs that are responsible for similar expression of a group of genes. In order to reduce mining time and complexity, however, most existing sequence mining algorithms either focus on finding short DNA sequences or require explicit specification of sequence lengths in advance. The challenge is to find longer sequences without specifying sequence lengths in advance. In this paper, we propose an efficient approach to mining maximal contiguous frequent patterns from large DNA sequence datasets. The experimental results show that our proposed approach is memory-efficient and mines maximal contiguous frequent patterns within a reasonable time.
Base Sequence
;
Computational Biology
;
Databases, Nucleic Acid
;
DNA
;
Mining
5.A Bio-database Management System for the Monitoring and Automatic FTP of Public Databases.
Hongseok TAE ; Jeong Min HAN ; Bu Young AHN ; Kiejung PARK
Genomics & Informatics 2008;6(2):95-97
Many bioinformatics sites have managed local bio-databases, including major databases such as GenBank and PIR with update load. We have developed several programs to monitor the update status of these databases and to FTP them automatically. These programs can be used for maintaining local bio-databases as recent versions and providing up-to-date databases through FTP sites. Currently, the program serves major bio-databases and will extend to accommodate many more bio-databases.
Computational Biology
;
Databases, Nucleic Acid
;
Formycins
;
Organothiophosphorus Compounds
;
Ribonucleotides
6.The beta version of LongMan: a large-scale mammalian lncRNA database orthologous to human lncRNAs.
Xiao-Xue YANG ; Hai ZHANG ; Sha HE ; Jie LIN ; Hao ZHU
Journal of Southern Medical University 2016;36(4):552-556
OBJECTIVETo predict orthologous sequences of the GENCODE-identified 13 562 human long non-coding RNAs (lncRNA) in 16 mammalian genomes and construct a lncRNA database LongMan for lncRNA studies.
METHODSThe exon structures of a total of 13 562 human lncRNAs were analyzed using RNAfold, and their orthologous sequences were searched against 16 mammalian genomes using Infernal. The potential orthologous genes, transposons and splicing signals of human lncRNAs were predicted to construct a lncRNA database with a updating mechanism.
RESULTSand
CONCLUSIONThe lncRNA database LongMan we constructed, which currently contains 133 646 orthologous lncRNAs, provides information of the sequences, alignments, transposons, and species-specific insertions and deletions and allows database search on combinatorial conditions, graphic display and data download. As the first large-scale mammalian orthologous lncRNA database, LongMan has important values in future comparative and functional studies of lncRNAs.
Animals ; Databases, Nucleic Acid ; Exons ; Humans ; RNA, Long Noncoding
7.Unique Phylogenetic Lineage Found in the Fusarium-like Clade after Re-examining BCCM/IHEM Fungal Culture Collection Material.
David TRIEST ; Koen DE CREMER ; Denis PIÉRARD ; Marijke HENDRICKX
Mycobiology 2016;44(3):121-130
Recently, the Fusarium genus has been narrowed based upon phylogenetic analyses and a Fusarium-like clade was adopted. The few species of the Fusarium-like clade were moved to new, re-installed or existing genera or provisionally retained as "Fusarium." Only a limited number of reference strains and DNA marker sequences are available for this clade and not much is known about its actual species diversity. Here, we report six strains, preserved by the Belgian fungal culture collection BCCM/IHEM as a Fusarium species, that belong to the Fusarium-like clade. They showed a slow growth and produced pionnotes, typical morphological characteristics of many Fusarium-like species. Multilocus sequencing with comparative sequence analyses in GenBank and phylogenetic analyses, using reference sequences of type material, confirmed that they were indeed member of the Fusarium-like clade. One strain was identified as "Fusarium" ciliatum whereas another strain was identified as Fusicolla merismoides. The four remaining strains were shown to represent a unique phylogenetic lineage in the Fusarium-like clade and were also found morphologically distinct from other members of the Fusarium-like clade. Based upon phylogenetic considerations, a new genus, Pseudofusicolla gen. nov., and a new species, Pseudofusicolla belgica sp. nov., were installed for this lineage. A formal description is provided in this study. Additional sampling will be required to gather isolates other than the historical strains presented in the present study as well as to further reveal the actual species diversity in the Fusarium-like clade.
Databases, Nucleic Acid
;
Fusarium
;
Genetic Markers
;
Phylogeny
;
Sequence Analysis
8.Construction of nervous system relative protein and gene secondary database.
Pan WANG ; Xinhao CHEN ; Xiangming LIU
Journal of Biomedical Engineering 2007;24(5):1023-1026
Along with the rapid research of neural molecular biology, abundant data are produced so that the collection and coordination of high-throughout data about nervous system relative proteins and genes are imperative. Through analyzing the biological primary databases maintained by NCBI and RCSB as the main data source and designing a new data model, a local specialized secondary database is constructed, which mainly includes nucleotide sequences, protein sequences and protein structures, and is established on Sun Blade 2000 System and Oracle 9i. All programs are developed by Java technology. A method of web information automatic retrieval with XML is proposed for sequence data collection and submission to the database. JSP + JavaBean technology is used to support data promulgation on Internet. The establishment of this database provides an excellent platform for the research of neural molecular biology and the pathogenesis of related diseases.
Database Management Systems
;
Databases, Nucleic Acid
;
Databases, Protein
;
Gene Expression Profiling
;
Humans
;
Nervous System
;
Proteins
;
chemistry
9.Sequence analysis of the spike gene of Porcine epidemic diarrhea virus isolated from South China during 2011–2015.
Xiaoya ZHAO ; Zhili LI ; Xiduo ZENG ; Guanqun ZHANG ; Jianqiang NIU ; Baoli SUN ; Jingyun MA
Journal of Veterinary Science 2017;18(2):237-243
The spike gene of porcine epidemic diarrhea virus (PEDV) was sequenced from 55 South China field strains isolated from pigs with symptoms of diarrhea. The sequences were compared within the set of field strains as well as with reference strains available in GenBank. Within the 55 South China PEDV field strains, the deduced amino acid sequence identities ranged from 93.8% to 99.9 % and ranged from 90.7% to 99.5% when compared with the foreign reference strains in GenBank. Our phylogenetic analysis showed that 10 of the 55 South China PEDV strains belonged to G1b and 45 belonged to G2b.
Amino Acid Sequence
;
China*
;
Databases, Nucleic Acid
;
Diarrhea
;
Porcine epidemic diarrhea virus*
;
Sequence Analysis*
;
Swine
10.Non-amplification of an Allele of D8S1179 Locus Due to Point Mutation in Flanking Region.
Korean Journal of Legal Medicine 2000;24(1):33-42
AmpFlSTR Profiler PlusTM PCR amplification system is highly efficient commercial DNA profiling system that are used by many institute for forensic individual identification. During the population study of 128 true Korean families (626 persons) with 9 loci of AmpFlSTR Profiler PlusTM PCR amplification system, we found unusual homozygous genotype at the D8S1179 locus in 4 families that is hard to explain with the Mendel's hereditary principles. The genotyping results suggest possibility of presence of non-amplified alleles due to deletion of one allele or polymorphism of primer-binding site. So we designed a new pair of primers of D8S1179 locus from GenBank data (GenBank Accession No. G08710). The newly designed primers amplified alleles that are not amplified with AmpFlSTR Profiler PlusTM PCR amplification system and all homozygous genotypes are typed to heterozygous genotype. We sequenced alleles of family members who have non-amplified alleles and we found G to A transition at the position of 147th base of GenBank sequence.
Alleles*
;
Databases, Nucleic Acid
;
DNA Fingerprinting
;
Genotype
;
Humans
;
Point Mutation*
;
Polymerase Chain Reaction