Events

Apr 7 - Apr 13, 2019

From Bioinformatics to Translational and Health Informatics: Whitening the Black Box

Prof. Yasser EL-Manzalawy, Information Sciences and Technology, Pennsylvania State University

Apr 8, 12:00 - 13:00

B9 H1

Pennsylvania State University machine learning

Machine learning has been extensively used in developing predictive models for a variety of bioinformatics tools. In many of these applications, to achieve the highest possible predictive performance, black box models (e.g., SVMs and Neural Networks) had been preferred over white box models (e.g., Decision Trees and Rule-based models). First, I argue that sacrificing interpretability for the sake of performance is a reasonable decision for many bioinformatics applications. However, I will demonstrate that uncareful utilization of black box models can lead to misinterpretations of the results.

Mar 31 - Apr 6, 2019

Mar 24 - Mar 30, 2019

Mar 17 - Mar 23, 2019

The Evolution of Machine Learning

Professor George Karypis, University of Minnesota

Mar 18, 12:00 - 13:00

B4 B5 A0215

University of Minnesota machine learning

This talk presents an overview of recent methodological advances in developing item-based nearest-neighbor-based recommender systems that have substantially improved their performance. Biography George Karypis is a Distinguished McKnight University Professor and an ADC Chair of Digital Technology at the Department of Computer Science & Engineering at the University of Minnesota, Twin Cities. His research interests span the areas of data mining, high performance computing, information retrieval, collaborative filtering, bioinformatics, cheminformatics, and scientific computing. His research has

Mar 3 - Mar 9, 2019

Promoters identification and analysis

Ramzan Umarov, Ph.D. Student, Computer Science

Mar 7, 16:00 - 18:00

B1 L2 R2202

promoters protein coding RNA genes TSS prediction tools

Promoter is a key region that is involved in differential transcription regulation of protein-coding and RNA genes. The gene-specific architecture of promoter sequences makes it extremely difficult to devise the general strategy for their computational identification.

Feb 24 - Mar 2, 2019

Neuro-symbolic systems in the life sciences

Robert Hoehndorf, Associate Professor, Computer Science

Feb 25, 12:00 - 13:00

B9 H1 R2322

biomedical data data analysis symbolic methods

Abstract The life sciences have invested significant resources in the development and application of semantic technologies to make research data accessible and interlinked, and to enable the integration and analysis of data. Utilizing the semantics associated with research data in data analysis approaches is often challenging. Now, novel methods are becoming available that combine symbolic methods and statistical methods in Artificial Intelligence. In my talk, I will describe how to generate knowledge graph embeddings for analysis of biological and biomedical data. Brief Biography Robert

Feb 3 - Feb 9, 2019

Nov 25 - Dec 1, 2018

AI4GH Seminar Series - Computational Modeling of Malaria Metabolism Reveals Different Stages and Species Nutrient Preferences and Drug Targets

Alyaa M Mohamed, Ph.D., Bioscience

Nov 25, 12:00 - 13:00

B2 R5220

genome plasmodial infections

Malaria kills nearly one-half million people a year and over 1 billion people are at risk of becoming infected by the parasite. Plasmodial infections are difficult to treat for a myriad of reasons, but the ability of the organism to remain latent in hosts and the complex life cycles greatly contributed to the difficulty in treat malaria.

Nov 18 - Nov 24, 2018

Nov 11 - Nov 17, 2018

AI4GH Seminar Series - Towards Rational Design of Biosynthesis Pathways

Meshari Alazmi, Ph.D., Computer Science

Nov 11, 12:00 - 13:00

B2 R5220

machine learning bioinformatics structural biology systems biology

Recent advances in genome editing and metabolic engineering enabled a precise construction of de novo biosynthesis pathways for high-value natural products. One important design decision to make for the engineering of heterologous biosynthesis systems is concerned with which foreign metabolic genes to introduce into a given host organism.

Nov 4 - Nov 10, 2018

Oct 28 - Nov 3, 2018

Oct 21 - Oct 27, 2018

Oct 14 - Oct 20, 2018

AI4GH Seminar Series - Variant Prioritization in Cancer: Understanding and Prediction of Cancer Driver Genes and Mutations

Oct 17, 12:00 - 13:00

B2 R5220

bioinformatics text mining machine learning cancer Ontologies

Sequencing has identified millions of somatic mutations in human cancers. Identifying and distinguishing cancer driver genes amongst the millions of candidate mutations remains a major challenge.

Oct 7 - Oct 13, 2018

AI4GH Seminar Series | Predicting Protein Function and Phenotype Associations

Maxat Kulmanov, Research Scientist, Bio-Ontology Research Group

Oct 10, 12:00 - 13:00

B2 R5220

biology biomedicine artificial intelligence knowledge discovery data integration protein function prediction

The amount of available protein sequences is rapidly increasing, mainly as a consequence of the development and application of high throughput sequencing technologies in the life sciences.

Sep 30 - Oct 6, 2018

AI4GH Seminar Series| Artificial Intelligence for Genomics and Health: Introduction and Overview

Robert Hoehndorf, Associate Professor, Computer Science

Oct 3, 12:00 - 13:00

B2 R5220

biology biomedicine bioinformatics Ontologies

Abstract Genomics and health are prime areas for the development and application of novel intelligent algorithms due to a large amount of available research data, the high complexity of the problem, and the significant social and economic benefits of improving health outcomes in humans. In the past, genomics and health have been one of the key drivers in the development of Artificial Intelligence methods, including machine learning, expert systems, or graph-based algorithms. I will introduce the new seminar series on Artificial Intelligence for Genomics and Health (AI4GH), and then discuss our

Sep 16 - Sep 22, 2018

CS Graduate Seminar by visitor Professor Vadim Lozin

Vadim Lozin, Professor, University of Warwick, UK

Sep 17, 14:00 - 15:00

KAUST

Abstract In this talk, we introduce a new graph parameter under the name functionality. We show that functionality generalizes simultaneously several other graph parameters, such as degeneracy or clique-width, by proving that bounded degeneracy or bounded clique-width imply bounded functionality. Moreover, we show that this generalization is proper by revealing classes of graphs of unbounded degeneracy and clique-width, where functionality is bounded by a constant. This includes permutation graphs and unit interval graphs. We also observe that bounded functionality implies bounded VC-dimension

Jul 29 - Aug 4, 2018

Similarity Algorithms by Anas Ismail

Aug 2, 09:00 - 11:00

B3 R5209

Abstract Here we provide two similarity algorithms each for a specific type of data. First, we provide a new algorithm to calculate the Gromov hyperbolicity constant which is a measure of how similar a metric is to a tree metric. We also provide a new algorithm determining how similar two spatial trajectories are.

Jul 22 - Jul 28, 2018

AI in Cancer Precision Medicine Workshop

Dr. Paul Schofield

Jul 25, 08:00 - Jul 26, 13:00

B3 R5220

We will investigate how novel AI technologies, including progress in machine learning, knowledge representation and reasoning can be applied to improving diagnosis and treatment of cancer in the era of genomic medicine.

Apr 15 - Apr 21, 2018

Neural Inductive Matrix Factorization for Predicting Disease-Gene Associations

Siqing Hou, M.S., Computer Science

Apr 18, 10:00 - 11:30

B3 R5208

bioinformatics machine learning Disease-Gene Associations

In silico prioritization of undiscovered associations can help find causal genes of newly discovered diseases. Some existing methods are based on known associations and side information of diseases and genes. We exploit the possibility of using a neural network model, Neural Inductive Matrix Completion (NIMC) in disease-gene prediction.

Apr 8 - Apr 14, 2018

Ontology Design Patterns for Combining Pathology and Anatomy: Application to Study Ageing and Longevity in Inbred Mouse Strains

Sarah Alghamdi, Ph.D. Student, Computer Science

Apr 10, 13:00 - 14:30

B9 R3120

biomedicine Ontologies data analysis semantic analysis computation techniques

Abstract In biomedical research, ontologies are widely used to represent knowledge as well as annotate datasets. Many of the existing ontologies cover a single type of phenomena, such as a process, cell type, gene, pathological entity or anatomical structure. Consequently, it is required to use multiple ontologies to fully characterize the observations in the datasets. Although this allows precise annotation of different aspects of a given dataset, it limits our ability to use the ontologies in data analysis, as the ontologies are usually disconnected and their combination cannot be exploited

Mar 25 - Mar 31, 2018

Big Data in Biodiversity and Health

Mar 26, 09:30 - Mar 28, 13:30

B3 L5 5209

About We are witnessing today an enormous increase in the volume and complexity of data across a variety of domains, including bioscience. Extracting useful information from such data is challenging. Although many approaches have already been developed, efficient analysis of big data in bioscience domain is far from satisfactory. Biodiversity and health are prominently characterized by a high volume of data with great complexity of information contained, which lead to various approaches to data analyses. The goal of this workshop is to present a selection of efforts currently being made at

Mar 18 - Mar 24, 2018

Computational and Statistical Interface to Big Data

Xin Gao, Professor, Computer Science

Mar 19, 08:00 - Mar 21, 17:00

B9 L2 H2

We are now in the fourth paradigm of science: Data Science. The massive amount of structured and unstructured data has posed new challenges and opportunities to the fields of computer science and statistics. Traditional computational and statistical methods for data storage, curation, sharing, querying, updating, visualization, analysis, and privacy have been shown to fail in the big data scenario due to the unprecedented volume, velocity, variety, veracity and value of the big data. This conference will bring together a number of prominent researchers in Computer Science and Statistics with common interests and active research in big data, as well as the researchers at KAUST who regularly generate or face big data, such as those in bioscience and red sea research.

Mar 11 - Mar 17, 2018

Feb 25 - Mar 3, 2018

Symbolic AI in Computational Biology: Applications to Disease Gene and Drug Target Identification

Robert Hoehndorf, Associate Professor, Computer Science

Feb 26, 16:30 - 17:30

The University of Cambridge in the United Kingdom

Abstract KAUST Assistant Professor Robert Hoehndorf will give a seminar on " Symbolic AI in Computational Biology: Applications to Disease Gene and Drug Target Identification" at the University of Cambridge in the United Kingdom. More Information The life sciences have invested significant resources in the development and application of semantic technologies to make research data accessible and interlinked, and to enable the integration and analysis of data. Utilizing the semantics associated with research data in data analysis approaches is often challenging. Now, novel methods are

Feb 18 - Feb 24, 2018

Keynote Speaker | The 8th BEAR PGR Conference & Users Forum 2018

Robert Hoehndorf, Associate Professor, Computer Science

Feb 23, 09:00 - 16:30

The University of Birmingham in the United Kingdom

High Performance Computing cloud storage data visualisation

KAUST Assistant Professor Robert Hoehndorf will be a keynote speaker at the 8th BEAR PGR Conference & Users Forum at the University of Birmingham in the United Kingdom.
Fifth KAUST-NVIDIA Workshop on Accelerating Scientific Applications Using GPUs

Timothy Lanfear , Brent Leback

Feb 18, 08:00 - Feb 20, 17:00

B4 B5 A0215

supercomputing

The KAUST Supercomputing Laboratory is co-organizing with NVIDIA, a leader in accelerated computing and artificial intelligence, a full-day workshop on accelerating scientific applications using GPUs on Tuesday, February 20th, 2018 in the auditorium between buildings 4 and 5.

Feb 4 - Feb 10, 2018

KAUST Research Workshop on Optimization and Big Data

Peter Richtarik, Professor, Computer Science

Feb 5, 08:00 - Feb 7, 05:00

B19 L3 H2

optimization machine learning Social Network Analysis asynchronous algorithms

The age of "big data" is here: data of unprecedented sizes is becoming ubiquitous, which brings new challenges and new opportunities. With this comes the need to solve optimization problems of unprecedented sizes.

Dec 10 - Dec 16, 2017

Novel Computational Methods to Predict Drug–target Interactions Using Graph Mining and Machine Learning Approaches

Rawan Olayan, Ph.D., Computer Science

Dec 11, 10:00 - 12:00

B3 L5 R5220

bioinformatics data integration data mining graph mining machine learning

Abstract Computational drug repurposing aims at finding new medical uses for existing drugs. The identification of novel drug-target interactions (DTIs) can be a useful part of such a task. Finding computationally DTIs is a convenient strategy to identify potentially new DTIs at low cost with reasonable accuracy. However, the current DTI prediction methods suffer a high false positive prediction rate. Here, we present a comprehensive review of the recent progress in the field of DTI prediction from data-centric and algorithmic-centric perspectives that can help in constructing novel reliable

Dec 3 - Dec 9, 2017

Big Data Analyses in Evolutionary Biology

Dec 4, 08:00 - Dec 6, 17:00

B9 H2

big data Big data analysis evolutionary biology

This event is organized by CBRC with financial support from the KAUST Office of Sponsored Research

Nov 5 - Nov 11, 2017

Contributions to In Silico Genome Annotation

Manal Kalkatawi, Ph.D., Computer Science

Nov 9, 10:00 - 13:00

B3 L5 R5209

bioinformatics data mining machine learning Deep learning genomics

Abstract Genome annotation is an important topic since it provides information for the foundation of downstream genomic and biological research. It is considered as a way of summarizing part of existing knowledge about the genomic characteristics of an organism. Annotating different regions of a genome sequence is known as structural annotation while identifying functions of these regions are considered as a functional annotation. In silico approaches can facilitate both tasks that otherwise would be difficult and time-consuming. This study contributes to genome annotation by introducing

May 21 - May 27, 2017

PCCFD - Predictive Complex Computational Fluid Dynamics

David Keyes, Professor, Applied Mathematics and Computational Science

May 22, 08:45 - May 24, 05:00

B9 L2 H1

CFD algorithms applied mathematics numerical analysis Computer science

The PCCFD workshop will focus on cutting-edge research in the field of algorithmic development for CFD and multi-scale complex flow simulations.

May 14 - May 20, 2017

Mining Genome-Scale Growth Phenotype Data through Constant-Column Biclustering

Majed Alzahrani, Ph.D., Computer Science

May 17, 15:00 - 17:00

B3 L5 R5209

data mining machine learning Computational biology

Growth phenotype profiling of genome-wide gene-deletion strains overstresses conditions can offer a clear picture that the essentiality of genes depends on environmental conditions. In this dissertation, we first demonstrate that detecting such "co-fit" gene groups can be cast as a less well-studied problem in biclustering, i.e., constant-column biclustering. Despite significant advances in biclustering techniques, very few were designed for mining in growth phenotype data.

Apr 16 - Apr 22, 2017

Breaking the Boundaries: from Structure to Algorithms

Vadim Lozin, Professor, University of Warwick, UK

Apr 17, 14:00 - 15:00

KAUST

maximum independent set line graphs boundary classes of graphs

Abstract Finding a maximum independent set in a graph is an NP-hard problem. However, restricted to the class of line graphs this problem becomes polynomial-time solvable due to the celebrated matching algorithm of Jack Edmonds. What makes the problem easy in the class of line graphs and what other restrictions can lead to an efficient solution? To answer these questions, we employ the notion of boundary classes of graphs. In this talk, we shed some light on the structure of the boundary separating difficult instances of the problem from polynomially solvable ones and analyze algorithmic tools

Apr 9 - Apr 15, 2017

Nov 13 - Nov 19, 2016

Novel Data Mining Methods for Virtual Screening of Biological Active Chemical Compounds by Othman Soufan

Othman Soufan, Ph.D., Computer Science

Nov 16, 14:00 - 15:00

H2 B9

machine learning data mining Computational biology biomedical applications Chemical compounds visualization

Abstract Drug discovery is a process that takes many years and hundreds of millions of dollars to reveal a con dent conclusion about a specific treatment. Part of this sophisticated process is based on preliminary investigations to suggest a set of chemical compounds as candidate drugs for the treatment. Computational resources have been playing a significant role in this part through a step known as virtual screening. From a data mining perspective, the availability of rich data resources is key in training prediction models. Yet, the difficulties imposed by big expansion in data and its