Home
Hasso-Plattner-Institut
Hasso-Plattner-Institut
Prof. Dr. h.c. mult. Hasso Plattner
schließen
schließen

Our team is giving a series of lectures and seminars with a focus on enterprise systems design and in-memory data management. Strong links to the industry ensure a close connection between theory and its implementation in the real world.

schließen
schließen
schließen

If you are having questions regarding one of our publications, please contact the authors.

schließen
  1. HOME
  2. > Projects
  3. > Project Archive
  4. > OKOA

Identifying Discriminant Cancer Genes

News: OKOA app is publicly available now! Try out yourself and explore public cancer data at http://okoa.epic-hpi.de/

Cancer is a Heterogeneous Disease

Cancer is the second leading cause of death worldwide. Yet, we still have little knowledge on it – one reason is that there is no single cancer disease, but the word "cancer" is rather used to refer to any of the 200 diseases that are characterized by an uncontrolled growth of cells, invading and damaging the body’s normal tissues. As no cancer is like the other, researchers strike to identify the main actors of the uncontrolled cell growth. Those main actors are typically genes that are abnormally (higher or lower) expressed in cancer cells and thus negatively affect the cell processes. Researchers are especially interested in changed behavior of gene expressions, as this tells them a lot about the molecular processes and relationships in cancerous cells, e.g gene functions and interactions.

RNA-Sequencing (RNA-Seq) delivers a complete snapshot of gene expression in a cell, with a single experiment containing expression levels of tens of thousands of genes from multiple hundred samples. The nature of gene expression data, however, poses challenges to its analysis in terms of its high dimensionality, noise, and complexity of the underlying biological processes to detect. Researchers aim to identify genes that reliably discriminate sample groups from each other, e.g. a cancerous from a healthy one. The current state-of-the-art for gene selection is to apply traditional statistical and machine learning approaches, e.g. Support Vector Machines (SVM).

Identifying Discriminant Cancer Genes

The aim of the project is to design and implement a technique that identifies markers for given clusters of cancer types. We use state-of-the-art and extended machine learning techniques to analyze genetic cancer data, e.g. gene expression profiles, that have been grouped prior into cancer types. For each cluster, i.e. cancer type, we aim to identify the cluster-discriminant features, e.g. those genes whose expression pattern is unique for the respective cluster. Moreover, we enable the user to straightforward specify algorithms and parameters in our web application. The results can be explored in an interactive diagram and be assessed with supplementary fitness scores. Additionally, the found genes are automatically evaluated regarding their cancer-relevance based on external biological knowledge bases and can be further examined with regards to their function in the cell and their role in causing cancer.

The project resulted in an interactive explorational web application named OKOA (hawaii.: different). We have imported public gene expression data from The Cancer Genome Atlas (TCGA) and the Genotype-Tissue Expression (GTEx) programm and made it ready to be analyzed in our app. Try out yourself and visit http://okoa.epic-hpi.de/!

Contact: Cindy Perscheid, Milena Kraus, Matthias Uflacker

News

22.09.2023 | Trends and Concepts in the Softwareindustry Seminar offered in WiSe 2023/2024

Trends and Concepts in the Softwareindustry Seminar offered in WiSe 2023/2024 > Zum Artikel

22.05.2023 | Christopher Hagedorn Successfully Defended His PhD Thesis

Christopher Hagedorn Successfully Defended His PhD Thesis > Zum Artikel

03.03.2023 | Last Trends and Concepts course of Prof. Hasso Plattner

After more than 20 years of teaching, our founder and benefactor Prof. Hasso Plattner visited the HPI this week for his … > Zum Artikel

01.03.2023 | Jan Kossmann Successfully Defended His PhD Thesis

Last week, Jan Kossmann another PhD student of our EPIC group successfully defended his thesis on the topic of … > Zum Artikel

26.02.2023 | Paper on Data Tiering in Hyrise Published in BTW Proceedings

Our latest paper on data tiering in Hyrise "Workload-Driven Data Placement for Tierless In-Memory Database Systems" by … > Zum Artikel

24.02.2023 | Paper on EPIC Research Group Published in SIGMOD Record

Our report "Enterprise Platform and Integration Concepts Research at HPI" has been published in the December issue of … > Zum Artikel

30.11.2022 | Paper on Database Optimizations for Spatio-Temporal Data published in PVLDB

Our paper "Robust and Budget-Constrained Encoding Configurations for In-Memory Database Systems" has been published in … > Zum Artikel

04.10.2022 | Günter Hesse Successfully Defended His PhD Thesis

Last week, Günter Hesse another PhD student of our EPIC group successfully defended his thesis on the topic of "A … > Zum Artikel

08.07.2022 | Successful PhD Defense by Markus Dreseler

Markus Dreseler has successfully defended his PhD thesis on Automatic Tiering for In-Memory Database Systems. > Zum Artikel

Literature

"A Course in In-Memory Data Management" by Prof. Dr. h.c. Hasso Plattner. This book is the culmination of six years work of in-memory research. As such, it provides the technical foundation for combined transactional and analytical workloads inside one single database as well as examples of new applications that are now possible given the availability of the new technology. The book is available at Springer.

Contact

Dr. Michael Perscheid

Chair Representative

Tel.: +49 (331) 5509-566

E-Mail:


Office:

Room: V-2.12

Tel.: +49 (331) 5509-560

Fax: +49 (331) 5509-579

E-Mail:

Follow us on Twitter

Contact Details

AltStyle によって変換されたページ (->オリジナル) /