Big Data for Healthcare and Biomedical Research

Biomedical Informatics (BMI) is a multidisciplinary field that is motivated by our desire to improve diagnosis, clinical care, and human health, through novel computational approaches to use (and learn from) biomedical and clinical data. We use our expertise in computer science and informatics by developing various enabling tools, technologies, and algorithms to solve specific biomedical and clinical applications. And in doing so help advance our understanding of disease and treatment, and also develop useful software and applications. Members of the department work in a variety of areas that range from machine learning, healthcare middleware that levrages cloud computing, clinical information systems, clinically oriented image analysis and biomedical knowledge modeling. The driving applications for the various ongoing projects include cancer research, organ transplant, HIV, medical imaging, radiation therapy, and clinical data analytics. All development work that is undertaken is free and open-source. In previous years, GSOC students have worked on diverse projects such as: geospatial systems for exploring microscopy environments that leveraged Hadoop; GPU accelerated pipelines for computational analysis of digitized biopsies; interactive visualization platforms for viewing massive images (>1GB); systems for data agnostic sharing of biomedical research datasets; Apache Drill based data integration platform; CNN based high throughput analysis of digitized biopsies; and information visualization of heterogeneous medical data. Many of these projects have been published in reputable journals and presented at major conferences. Some of the projects proved to be so successful, that they were adopted in major national/international biomedical research initiatives.

lightbulb_outline View ideas list


  • java
  • medical imaging
  • deep learning
  • hadoop
  • python


  • Science and Medicine
  • cloud
  • web platform and services
  • precision medicine
  • big data visualization
  • data fusion
email Mailing list
mail_outline Contact email

Biomedical Informatics, Emory University 2016 Projects

  • coco90417
    Building Survival Models from Genomic Data with Google TensorFlow
    Large-scale cancer genomic research, such as TCGA project generated huge amount of high-dimensional data, yet most features are under-explored for...
  • Peter Vu
    De-Mystifying Medical Imaging DICOM Data
    A lot of Medical Imaging data used and shared today is formatted using the DICOM standard. It can be collected and rendered into a Descriptive Data...
  • Stefan Florea
    Information Visualizations using Datascope
    DataScope is an interactive dashboard system for doing exploratory analysis on large biomedical datasets. Currently the dashboard is splitted into...
  • Nelson
    Multi-resolution Analysis of Pathology Data with Convolutional Neural Networks (CNNs)
    The ultimate goal of this project is exploring high and low-resolution images techniques for performing high-level inferences over complete tissue...
  • Irene
    Near Duplicate Detection in Medical Image Archives Proposal
    I plan to complete the project with method of classification and indexing technology on Spark as well as Hazelcast if necessary. Hope for more...