Laboratory home page top

Return to CGED

 

Cancer Gene Expression Profiling

Summary of The Effort
Description of CGED (Cancer Gene Expression Database)
Flat Data Files for CGED
References
Credits

 

During the period 2000-2004, we constructed a comprehensive collection of gene expression and clinical data from a large number of patients with the major cancers. It is followed by a number of informatics studies for knowledge extraction. The expression data have been obtained by adaptor-tagged competitive PCR (ATAC-PCR), a high-throughput RT-PCR technique. RT-PCR has a wider dynamic range of detection, compared to hybridization-based techniques. In addition, tumor specimens are obtained from a limited number of hospitals, and uniform clinical backgrounds are expected. The main focus of the following researches was exploration of correlation between gene expression and clinical data for future diagnostic application.

 

Summary of The Effort

The main part of the project was carried out in Taisho Laboratory of Functional Genomics, Nara Institute of Science and Technology. The tumor samples were mainly supplied from Osaka University Medical School. Glioblastoma and lung cancer is from Kyoto University Medical School and from Osaka Medical Center for Cancer and Cardiovascular Diseases, respectively.
For gene expression profiling, we at first surveyed expressed genes in each cancer tissue by EST sequencing. Genes for the ATAC-PCR assay were selected by the order of the abundance. Genes selected by specialists were also added for the assay. The number of genes for the ATAC-PCR assay was ranged from 1536 to 3072. Gene expression data were obtained as relative expression levels against control samples. We usually used mixtures of tumor tissues from several patients to detect the slight differences among tumor tissues from different patients.
Currently, acquisition of data for breast, colorectal, hepatocellular, esophageal, thyroid,gastric and lung cancers, and glioblastoma has been finished.

Factual details of each cancer project

Breast cancer (molecular classification)
Specimen: 301 samples (Nov 1996 to Sep 2000, Osaka University Medical School Hospital). 98 samples were used for gene expression profiling. Others were for validation.
Number of assayed genes: 2412 genes
Reference: 3

Breast cancer (docetaxel resistance)
Specimen: 70 samples (Feb 1999 to June 2002, Osaka University Medical School Hospital) subjected for neoadjuvant therapy by docetaxel. All samples were obtained by biopsy before the therapy. 44 samples for the learning set are included in CGED.
Number of assayed genes: 2453 genes
Reference: 11

Colorectal cancer
Specimen: 100 samples (April 1994 to September 2000, Osaka University Medical School Hospital).
Number of assayed genes: 1536 genes
Reference: 4

Hepatocellular cancer
Specimen: tumor 120, non-tumor 86, normal 32 (January 1997 to April 2003, Osaka University Medical School Hospital)
Number of assayed genes: 3072 genes
References: 5, 7, 8, 9, 10

Gastric cancer
Specimen: 123 samples (Feburary 1994 to April 2003, Osaka University Medical School Hospital).
Number of assayed genes: 2304 genes
Reference: 4, 18

Thyroid cancer
Specinen: papillary carcinoma 42, follicular carcinoma 28, follicular adenoma 58, normal thyroid tissue 40 (Feb 1997 to Nov 2002, Osaka University Medical School Hospital, Kuma Hospital)
Number of assayed genes: 2516 genes
Reference: 15

Esophageal cancer (squamaous cell carcinoma)
Specimen: cancer tissues 160 (April 1997 to Feb 1999, Osaka Medical Center for Cancer and Cardiovascular Diseases)
Number of assayed genes: 1904 genes

Glioma
Specimen: 152 gliomas (100 glioblastomas, 21 anaplastic astrocytomas, 19 diffuse astrocytomas, and 12 anaplastic oligodendrogliomas) (Department of Neurosurgery, Kyoto University Graduate School of Medicine)
Number of assayed genes: 3456 genes
Reference: 12

CGED (Cancer Gene Expression Database)

CGED (http://lifesciencedb.jp/cged/) is an integrated database of gene expression data and clinical information. The current update (March, 2007) includes data of the CGIP first phase for breast, breast (docetaxel response), colorectal, hepatocellular, esophageal, gastric and thyroid cancers.

Flat Data Files for CGED

The data are logarithmically converted relative expression levels against controls, normalized by median of samples and then by genes. Upper and lower cut-off values are set at 20 and 0.05 folds of the control, respectively. This normalization procedure is chosen for better presentation in mosaic plots. Although the procedure is popular, we have been used data nomalized only by median of samples for statistical studies. Such data sets are found in http://genome.mc.pref.osaka.jp/data_download.html. Due to space limitation, all of the data are not deposited for hepatocellular and thyroid cancers. In esophageal cancer, samples without clinical data are not depoisted.

Breast cancer (molecular classfication)              [Zip]
Breast cancer (docetaxel response)              [Zip]
Colorectal cancer             [Zip]
Hepatocellular cancer             [Zip]
Esophageal cancer             [Zip]
Gastric cancer             [Zip]
Thyroid cancer             [Zip]

 

 

References

1. Kato, K. Adaptor-tagged competitive PCR: a novel method for measuring relative gene expression. Nucleic Acids Res. 25 (1997) 4694-4696.
2. Matoba, R., Kato, K., Kurooka, C., Maruyama, C., Sakakibara, Y. and Matsubara, K. Correlation between gene functions and developmental expression patterns in the mouse cerebellum. Eur. J. Neurosci. 12 (2000) 1357-1371.
3. Iwao, K., Matoba, R., Ueno, N., Ando, A., Miyoshi, Y., Matsubara, K., Noguchi, S. and Kato, K. Molecular classification of primary breast tumors possessing distinct prognostic properties. Hum Mol Genet, 11 (2002) 199-206.
4. Muro, S., Takemasa, I., Oba, S., Matoba, R., Ueno, N., Maruyama, M., Yamashita, R., Sekimoto, M., Yamamoto, H., Nakamori, S., Monden, M., Ishii, S., and Kato, K. Identification of expressed genes linked to the malignancy of human colorectal carcinoma by parametric clustering of quantitative expression data. Genome Biol. 4 (2003) R21.1-10.
5. Kurokawa, Y., Matoba, R., Takemasa, I., Nakamori, S., Tsujie, M., Nagano, H., Dono, K., Umeshita, K., Sakon, M., Ueno, N., Kita, H., Oba, S., Ishii, S., Kato, K. and Monden, M. Molecular features of non-B, non-C hepatocellular carcinoma: a PCR-array gene expression profiling study. J. Hepatol., 39 (2003) 1004-1012.
6. Kato, K., Yamashita, R., Takagi, T. and Nakai, K. Cancer Gene Expression Database (CGED). The 2004 Molecular Biology Database Collection. Nucleic Acids Res., 32 (2004) D3-D22.
7. Kurokawa, Y., Matoba, R., Nakamori, S., Nagano, H., Dono, K., Umeshita, K., Sakon, M., Monden, M. and Kato, K. PCR-array gene expression profiling of hepatocellular carcinoma. J Exp Clin. Cancer Res., 23 (2004) 135-141.
8. Kurokawa, Y., Matoba, R., Takemasa, I., Nagano, H., Dono, K., Nakamori, S., Umeshita, K., Sakon, M., Ueno, N., Oba, S., Ishii, S., Kato, K., and Monden, M. Molecular-based prediction of early recurrence in hepatocellular carcinoma. J. Hepatol. 41(2004) 284-291.
9. Sese, J., Kurokawa, Y., Monden, M., Kato, K. and Morishita, S. Constrained clusters of gene expression profiles with pathological features. Bioinformatics 20 (2004) 3137-3145.
10. Kurokawa, Y., Matoba, R., Nagano, H., Sakon, M., Takemasa, I., Nakamori, S., Dono, K., Umeshita, K., Ueno, N., Ishii, S., Kato, K. and Monden, M. Molecular prediction of response to 5-fluorouracil and interferon-alpha combination chemotherapy in adavanced hepatocellular carcinoma. Clin. Cancer Res., 10 (2004) 6029-6038.
11. Iwao-Koizumi, K., Matoba, R., Ueno, N., Kim, S.J., Ando, A., Miyoshi, Y., Maeda, E., Noguchi, S. and Kato, K. Prediction of docetaxel response in human breast cancer by gene expression profiling. J. Clin. Oncol., 23 (2005) 422-431.
12. Kato, K., Yamashita, R., Matoba, R., Monden, M., Noguchi, S., Takagi, T. and Nakai, K. Cancer Gene Expression Database (CGED): a database for gene expression profiling and accompanying clinical information of human cancer tissues, Nucleic Acids Res., 33 (2005) D533-D536.
13. Motoori, M., Takemasa, I., Yano, M., Saito, S., Miyata, H., Takiguchi, S., Fujiwara, Y., Yasuda, T., Kurokawa, Y., Ueno, N., Oba, S., Ishii, S., Monden, M. and Kato, K. Prediction of recurrence in advanced gastric cancer patients after curative resection by gene expression profiling. Int. J. Cancer, 114 (2005) 963-968.
14. Yukinawa, N., Oba, S., Kato, K., & Ishii, S. Multi-class pattern classification based on a probabilistic model of combining binary classifiers. International Conference on Artificial Neural Networks (ICANN 2005) . Lecture Notes in Computer Science, 3697 (2005) 337 342.
15. Taniguchi, K., Takano, T., Miyauchi, A., Koizumi, K., Ito, Y., Takamura, Y., Ishitobi, M., Miyoshi, Y., Taguchi, T., Tamaki, Y., Kato, K. and Noguchi, S.. Differentiation of follicular thyroid adenoma from carcinoma by gene expression profiling with adapter-tagged competitive polymerase chain reaction. Oncology, 69 (2005) 428-435.
16. Kim, S.J., Miyoshi, Y., Taguchi, T., Tamaki, Y., Nakamura, H., Yodoi, J., Kato, K. and Noguchi, S. High thioredoxin expression is associated with resistance to docetaxel in primary breast cancer. Clin. Cancer Res. 23 (2005) 8425-8430.
17. Kurokawa, Y., Honma, K., Takemasa, I., Nakamori, S., Kita-Matsuo, H., Motoori, M., Nagano, H., Dono, K., Ochiya, T., Monden, M. and Kato, K. Central genetic alterations common to all HCV-positive, HBV-positive and non-B, non-C hepatocellular carcionoma: A new approach to identify novel tumor markers. Int. J. Oncol. 28 (2006) 383-391.
18. Oba, S., Kato, K. and Ishii, S. Multi-scale clustering for gene expression data. Fifth International Symposium on Bioinformatics and Bioengineering. BIBE, p210-217.
19. Motoori, M., Takemasa, I., Doki, Y., Saito, S., Miyata, H., Takiguchi, S., Fujiwara, Y., Yasuda, T., Yano, M., Kurokawa, Y., Komori, T., Yamasaki, M., Ueno, N., Oba, S., Ishii, S., Monden, M. and Kato, K. Prediction of peritoneal metastasis in advanced gastric cancer by gene expression profiling of the primary site. Eur. J. Cancer 42(2006) 1897-1903.
20. Yukinawa, N., Oba, S., Kato, K., Taniguchi, K., Iwao-Koizumi, K., Tamaki, Y., Noguchi, S., Ishii, S. A multi-class predictor based on a probabilistic model: application to gene expression profiling-based diagnosis of thyroid tumors. BMC Genomics 7 (2006) 190.
21. Shirahata, M., Iwao-Koizumi, K., Saito, S., Ueno, N., Oda, M., Hashimoto, N., Takahashi, J.A. and Kato, K. A gene expression-based molecular diagnostic system for malignant gliomas displays clinical utility, prognostic ability and reproducibility superior to histological diagnosis. Clin. Cancer Res. 13 (2007) 7341-7356.
22. Yukinawa, N., Oba, S., Kato, K. and Ishii, S. Optimal aggregation of binary classifiers for multi-class cancer diagnosis using gene expression profiles. IEEE/ACM Transactions on Computational Biology and Bioinformatics, in press.