1.Germline DNA-Repair Genes and HOXB13Mutations in Korean Men with Metastatic Prostate Cancer: Data from a Large Korean Cohort
Sang Hun SONG ; Hak-Min KIM ; Yu Jin JUNG ; Ha Rim KOOK ; Sungwon JEON ; Jong BHAK ; Jin Hyuck KIM ; Hakmin LEE ; Jong Jin OH ; Sangchul LEE ; Sung Kyu HONG ; Seok-Soo BYUN
The World Journal of Men's Health 2023;41(4):960-968
Purpose:
Germline mutations in DNA damage repair (DDR) genes such as BRCA2 have been associated with prostate cancer (PC) risk but has not been thoroughly evaluated for metastatic prostate cancer (mPC) in Asian men. This study attempts to evaluate frequency of DDR mutations in the largest cohort of Koreans.
Materials and Methods:
We recruited 340 patients with mPC unselected for family history of cancer and compared to 495 controls. Whole genome sequencing was applied to assess germline pathogenic/likely pathogenic variants (PV/LPVs) in 26 DDR genes and HOXB13, including 7 genes (ATM, BRCA1/2, CHEK2, BRIP1, PALB2, and NBN) associated with hereditary PC. Comparisons to published Caucasian and Japanese cohorts were performed.
Results:
Total of 28 PV/LPVs were identified in 30 (8.8%) patients; mutations were found in 13 genes, including BRCA2 (15 men [4.41%]), ATM (2 men [0.59%]), NBN (2 men [0.59%], and BRIP1 (2 men [0.59%]). Only one patient had HOXB13 mutation (0.29%). A lower rate of overall germline variant frequency was observed in Korean mPC compared to Caucasians (8.8% vs. 11.8%), but individual variants notably differed from Caucasian and geographically similar Japanese cohorts. PV/LPVs in DDR genes tended to increase gradually with higher Gleason scores (GS 7, 7.1%; GS 8, 7.5%; GS 9–10, 9.9%).
Conclusions
BRCA2 was the most frequently mutated gene common to different cohorts supporting its importance, but differences in variant distribution in Korean mPC underscore the need for ethnic-specific genetic models. Future ethnic-specific analyses are warranted to verify our findings.
2.A Clinical Risk Score to Predict In-hospital Mortality from COVID-19 in South Korea
Ae-Young HER ; Youngjune BHAK ; Eun Jung JUN ; Song Lin YUAN ; Scot GARG ; Semin LEE ; Jong BHAK ; Eun-Seok SHIN
Journal of Korean Medical Science 2021;36(15):e108-
Background:
Early identification of patients with coronavirus disease 2019 (COVID-19) who are at high risk of mortality is of vital importance for appropriate clinical decision making and delivering optimal treatment. We aimed to develop and validate a clinical risk score for predicting mortality at the time of admission of patients hospitalized with COVID-19.
Methods:
Collaborating with the Korea Centers for Disease Control and Prevention (KCDC), we established a prospective consecutive cohort of 5,628 patients with confirmed COVID-19 infection who were admitted to 120 hospitals in Korea between January 20, 2020, and April 30, 2020. The cohort was randomly divided using a 7:3 ratio into a development (n = 3,940) and validation (n = 1,688) set. Clinical information and complete blood count (CBC) detected at admission were investigated using Least Absolute Shrinkage and Selection Operator (LASSO) and logistic regression to construct a predictive risk score (COVID-Mortality Score).The discriminative power of the risk model was assessed by calculating the area under the curve (AUC) of the receiver operating characteristic curves.
Results:
The incidence of mortality was 4.3% in both the development and validation set.A COVID-Mortality Score consisting of age, sex, body mass index, combined comorbidity, clinical symptoms, and CBC was developed. AUCs of the scoring system were 0.96 (95% confidence interval [CI], 0.85–0.91) and 0.97 (95% CI, 0.84–0.93) in the development and validation set, respectively. If the model was optimized for > 90% sensitivity, accuracies were 81.0% and 80.2% with sensitivities of 91.7% and 86.1% in the development and validation set, respectively. The optimized scoring system has been applied to the public online risk calculator (https://www.diseaseriskscore.com).
Conclusion
This clinically developed and validated COVID-Mortality Score, using clinical data available at the time of admission, will aid clinicians in predicting in-hospital mortality.
3.A Clinical Risk Score to Predict In-hospital Mortality from COVID-19 in South Korea
Ae-Young HER ; Youngjune BHAK ; Eun Jung JUN ; Song Lin YUAN ; Scot GARG ; Semin LEE ; Jong BHAK ; Eun-Seok SHIN
Journal of Korean Medical Science 2021;36(15):e108-
Background:
Early identification of patients with coronavirus disease 2019 (COVID-19) who are at high risk of mortality is of vital importance for appropriate clinical decision making and delivering optimal treatment. We aimed to develop and validate a clinical risk score for predicting mortality at the time of admission of patients hospitalized with COVID-19.
Methods:
Collaborating with the Korea Centers for Disease Control and Prevention (KCDC), we established a prospective consecutive cohort of 5,628 patients with confirmed COVID-19 infection who were admitted to 120 hospitals in Korea between January 20, 2020, and April 30, 2020. The cohort was randomly divided using a 7:3 ratio into a development (n = 3,940) and validation (n = 1,688) set. Clinical information and complete blood count (CBC) detected at admission were investigated using Least Absolute Shrinkage and Selection Operator (LASSO) and logistic regression to construct a predictive risk score (COVID-Mortality Score).The discriminative power of the risk model was assessed by calculating the area under the curve (AUC) of the receiver operating characteristic curves.
Results:
The incidence of mortality was 4.3% in both the development and validation set.A COVID-Mortality Score consisting of age, sex, body mass index, combined comorbidity, clinical symptoms, and CBC was developed. AUCs of the scoring system were 0.96 (95% confidence interval [CI], 0.85–0.91) and 0.97 (95% CI, 0.84–0.93) in the development and validation set, respectively. If the model was optimized for > 90% sensitivity, accuracies were 81.0% and 80.2% with sensitivities of 91.7% and 86.1% in the development and validation set, respectively. The optimized scoring system has been applied to the public online risk calculator (https://www.diseaseriskscore.com).
Conclusion
This clinically developed and validated COVID-Mortality Score, using clinical data available at the time of admission, will aid clinicians in predicting in-hospital mortality.
4.New Lung Cancer Panel for High-Throughput Targeted Resequencing.
Eun Hye KIM ; Sunghoon LEE ; Jongsun PARK ; Kyusang LEE ; Jong BHAK ; Byung Chul KIM
Genomics & Informatics 2014;12(2):50-57
We present a new next-generation sequencing-based method to identify somatic mutations of lung cancer. It is a comprehensive mutation profiling protocol to detect somatic mutations in 30 genes found frequently in lung adenocarcinoma. The total length of the target regions is 107 kb, and a capture assay was designed to cover 99% of it. This method exhibited about 97% mean coverage at 30x sequencing depth and 42% average specificity when sequencing of more than 3.25 Gb was carried out for the normal sample. We discovered 513 variations from targeted exome sequencing of lung cancer cells, which is 3.9-fold higher than in the normal sample. The variations in cancer cells included previously reported somatic mutations in the COSMIC database, such as variations in TP53, KRAS, and STK11 of sample H-23 and in EGFR of sample H-1650, especially with more than 1,000x coverage. Among the somatic mutations, up to 91% of single nucleotide polymorphisms from the two cancer samples were validated by DNA microarray-based genotyping. Our results demonstrated the feasibility of high-throughput mutation profiling with lung adenocarcinoma samples, and the profiling method can be used as a robust and effective protocol for somatic variant screening.
Adenocarcinoma
;
DNA
;
Exome
;
High-Throughput Nucleotide Sequencing
;
Lung
;
Lung Neoplasms*
;
Mass Screening
;
Polymorphism, Single Nucleotide
;
Sensitivity and Specificity
5.How Many SNPs Should Be Used for the Human Phylogeny of Highly Related Ethnicities? A Case of Pan Asian 63 Ethnicities.
Hoyoung GHANG ; Youngjoo HAN ; Sangjin JEONG ; Jong BHAK ; Sunghoon LEE ; Tae Hyung KIM ; Chulhong KIM ; Sangsoo KIM ; Fahd AL-MULLA ; Chan Hyun YOUN ; Hyang Sook YOO
Genomics & Informatics 2011;9(4):181-188
In planning a model-based phylogenic study for highly related ethnic data, the SNP marker number is an important factor to determine for relationship inferences. Genotype frequency data, utilizing a sub sampling method, from 63 Pan Asian ethnic groups was used for determining the minimum SNP number required to establish such relationships. Bootstrap random sub-samplings were done from 5.6K PASNPi SNP data. DA distance was calculated and neighbour-joining trees were drawn with every re-sampling data set. Consensus trees were made with the same 100 sub-samples and bootstrap proportions were calculated. The tree consistency to the one obtained from the whole marker set, improved with increasing marker numbers. The bootstrap proportions became reliable when more than 7,000 SNPs were used at a time. Within highly related ethnic groups, the minimum SNPs number for a robust neighbor-joining tree inference was about 7,000 for a 95% bootstrap support.
Asian Continental Ancestry Group
;
Consensus
;
Ethnic Groups
;
Genotype
;
Humans
;
Phylogeny
;
Polymorphism, Single Nucleotide
6.Post-GWAS Strategies.
Genomics & Informatics 2011;9(1):1-4
Genome-wide association (GWA) studies are the method of choice for discovering loci associated with common diseases. More than a thousand GWA studies have reported successful identification of statistically significant association signals in human genomes for a variety of complex diseases. In this review, I discuss some of the issues related to the future of GWA studies and their biomedical applications.
Genome, Human
;
Genome-Wide Association Study
;
Humans
7.Structural Bioinformatics Analysis of Disease-related Mutations.
Seong Jin PARK ; Sangho OH ; Daeui PARK ; Jong BHAK
Genomics & Informatics 2008;6(3):142-146
In order to understand the protein functions that are related to disease, it is important to detect the correlation between amino acid mutations and isease. Many mutation studies about disease-related proteins have been carried out through molecular biology techniques, such as vector design, protein engineering, and protein crystallization. However, experimental protein mutation studies are time-consuming, be it in vivo or in vitro. We therefore performed a bioinformatic analysis of known disease-related mutations and their protein structure changes in order to analyze the correlation between mutation and disease. For this study, we selected 111 diseases that were related to 175 proteins from the PDB database and 710 mutations that were found in the protein structures. The mutations were acquired from the Human Gene Mutation Database (HGMD). We selected point mutations, excluding only insertions or deletions, for detecting structural changes. To detect a structural change by mutation, we analyzed not only the structural properties (distance of pocket and mutation, pocket size, surface size, and stability), but also the physico-chemical properties (weight, instability, isoelectric point (IEP), and GRAVY score) for the 710 mutations. We detected that the distance between the pocket and disease-related mutation lay within 20 A (98.5%, 700 proteins). We found that there was no significant correlation between structural stability and disease-causing mutations or between hydrophobicity changes and critical mutations. For large-scale mutational analysis of disease-causing mutations, our bioinformatics approach, using 710 structural mutations, called "Structural Mutatomics," can help researchers to detect disease-specific mutations and to understand the biological functions of disease-related proteins.
Computational Biology
;
Crystallization
;
Humans
;
Hydrophobic and Hydrophilic Interactions
;
Isoelectric Point
;
Molecular Biology
;
Point Mutation
;
Protein Engineering
;
Proteins
8.Personal Genomics, Bioinformatics, and Variomics.
Jong BHAK ; Ho GHANG ; Rohit REJA ; Sangsoo KIM
Genomics & Informatics 2008;6(4):161-165
In 2008 at least five complete genome sequences are available. It is known that there are over 15,000,000 genetic variants, called SNPs, in the dbSNP database. The cost of full genome sequencing in 2009 is claimed to be less than $5000 USD. The genomics era has arrived in 2008. This review introduces technologies, bioinformatics, genomics visions, and variomics projects. Variomics is the study of the total genetic variation in an individual and populations. Research on genetic variation is the most valuable among many genomics research branches. Genomics and variomics projects will change biology and the society so dramatically that biology will become an everyday technology like personal computers and the internet. 'BioRevolution' is the term that can adequately describe this change.
Biology
;
Computational Biology
;
Genetic Variation
;
Genome
;
Genomics
;
Humans
;
Internet
;
Microcomputers
;
Polymorphism, Single Nucleotide
;
Vision, Ocular
9.Biological Object Downloader (BOD) Service for Easy Download and Management of Biological Databases.
Daeui PARK ; Jungwoo LEE ; Giseok YOON ; Sungsam GONG ; Jong BHAK
Genomics & Informatics 2007;5(4):196-199
BOD is an FTP service management tool on the Internet. It was developed for biological researchers in South Korea. It enables easier and faster access of bioinformation without having to go through foreign FTP sites. BOD includes an automatic downloader with a management and email alert service from which the user can easily select and schedule any biological database. Once listed in BOD, the user can check and modify the download status and data from an additional email alert service.
Appointments and Schedules
;
Electronic Mail
;
Internet
;
Korea
10.BioCC: An Openfree Hypertext Bio Community Cluster for Biology.
Sungsam GONG ; Tae Hyung KIM ; Jungsu OH ; Jekeun KWON ; Su An CHO ; Dan BOLSER ; Jong BHAK
Genomics & Informatics 2006;4(3):125-128
We present an openfree hypertext (also known as wiki) web cluster called BioCC. BioCC is a novel wiki farm that lets researchers create hundreds of biological web sites. The web sites form an organic information network. The contents of all the sites on the BioCC wiki farm are modifiable by anonymous as well as registered users. This enables biologists with diverse backgrounds to form their own Internet bio-communities. Each community can have custom-made layouts for information, discussion, and knowledge exchange. BioCC aims to form an ever-expanding network of openfree biological knowledge databases used and maintained by biological experts, students, and general users. The philosophy behind BioCC is that the formation of biological knowledge is best achieved by open-minded individuals freely exchanging information. In the near future, the amount of genomic information will have flooded society. BioGG can be an effective and quickly updated knowledge database system. BioCC uses an opensource wiki system called Mediawiki. However, for easier editing, a modified version of Mediawiki, called Biowiki, has been applied. Unlike Mediawiki, Biowiki uses a WYSIWYG (What You See Is What You Get) text editor. BioCC is under a share-alike license called BioLicense (http://biolicense.org). The BioCC top level site is found at http://bio.cc/
Anonyms and Pseudonyms
;
Biology*
;
Computational Biology
;
Humans
;
Hypermedia*
;
Information Services
;
Internet
;
Licensure
;
Linear Energy Transfer
;
Philosophy

Result Analysis
Print
Save
E-mail