HOME    »    PROGRAMS/ACTIVITIES    »    Annual Thematic Program
IMA Annual Program Year Workshop
Large Data Sets in Medical Informatics
November 14-18, 2011

   Organizers
Nevenka DimitrovaPhilips Research Laboratory
W. Clem KarlBoston University
Jean-Christophe Olivo-MarinInstitut Pasteur
Ahmed TewfikUniversity of Texas, Austin
  Description
images/2011-2012/W11.14-18.11/group.jpg
Medical informatics is currently limited by two critical challenges: the need to process large data sets to make inferences and the small size of replicates that limits the confidence in these inferences. For example, it is well known that modern CT and MRI images involve acquiring and processing large amounts of data. Less appreciated are the facts that these large amounts of data require hours of post-moaning to guide surgical interventions and that the time requirements of these methodologies limit their practical applications. To provide surgeons with real-time three-dimensional tracking of organ deformation—required in the newest proposed minimally invasive surgical procedures, such as NOTES—will require several orders of magnitude speed-ups in data processing and fusion across imaging modalities. Similarly, neurological investigations are based on large data sets collected from arrays of electrodes implanted on or inside the brain, spinal cord, or severed limbs. Inferences result from the careful analysis of quantitative and qualitative patterns in these signals and their coherent or incoherent evolutions. The problem is further compounded by the nonstationary nature of the behavior-specific signals of interest and the presence of other interfering signals. In addition, learning is severely limited by the small size and lack of richness of the training data; researchers can typically collect data only from a small number of animals or human subjects. Thus, training data may not cover the spectrum of signal characteristics in a general population which limits the ability to construct accurate statistical models. Variations in surgical implantations further limit the quality of the data. Finally, genomics is based on the analysis of large data sets corresponding to DNA copy number variations or gene expression levels. Experimental imperfections and the limited number of replicates have again hampered the confidence in the results of analyses. Indeed, most studies in the field cannot be reproduced. Recent studies also suggest that diseases are likely linked to large numbers of rare genetic variants that are seldom captured by most databases collected to date. Furthermore, the need to do in vivo nanoscale imaging of specific cells has led to a slew of complex challenges which includes the traditional challenges in both genomics and medical imaging. This workshop will bring together mathematicians, statisticians, engineers, and scientists working on particular aspects of medical informatics or related areas. A careful look at the literature in any of the subfields of medical informatics reveals specialized approaches and philosophies combined with a lack of knowledge of other potentially useful methodologies developed in other subfields of medical informatics. Furthermore, results and methodologies discovered in pertinent subfields of mathematics and statistics remain largely unknown in medical informatics. Conversely, researchers in real analysis, differential equations, algebraic geometry, and statistics are often unaware of the characteristics of the challenges in medical informatics that limit the applicability of generic approaches.
  Schedule
  Participants

Connect With Us:
Go