Thursday, September 7 |
07:00 - 08:45 |
Breakfast ↓ Breakfast is served daily between 7 and 9am in the Vistas Dining Room, the top floor of the Sally Borden Building. (Vistas Dining Room) |
09:00 - 09:30 |
Emily King: A potpourri of multi-scale mathematical techniques ↓ In this talk, three different mathematical techniques that may be considered "multi-scale" will be surveyed with some examples of applications. One technique comes from harmonic analysis: the use of multiscale a.k.a. multiresolution analysis in transforms like wavelet and shearlet, as well as their use within morphological component analysis, which was originally developed to separate different types of astrophysical data. Another technique comes from topological data analysis, namely persistence homology, which tracks features like holes and texture. An application to classification of cloud types will be given. Finally, a technique to find subspaces of increasing dimension to best represent a data set, called the flag median, will be presented with applications from computer vision. (TCPL 201) |
09:30 - 09:50 |
Gary Choi: Quantifying shape variation using quasi-conformal geometry ↓ Quasi-conformal geometry has recently emerged as a useful tool in imaging science. In this talk, we will discuss how quasi-conformal theory can be applied for quantifying biological shape variation. More specifically, quasi-conformal mappings can be used for establishing a 1-1 correspondence between two biological shapes with prescribed feature landmarks exactly matched. Moreover, the quasi-conformal distortion encodes important information about the local geometric difference between two shapes. Examples across biological scales are presented to demonstrate the effectiveness of the method. (Online) |
10:00 - 10:30 |
Coffee Break (TCPL Foyer) |
10:30 - 11:00 |
Robert Ravier: Hypothesis Testing on Patch Spaces via Manifold Moving Least Squares with Application to Evolutionary Anthropology ↓ Methods for performing hypothesis testing on anatomical surfaces, such as multivariate T-tests on corresponding vertices, tend to produce non-meaningful results when the collection of surfaces vary over multiple species, as is often the case in evolutionary anthropology; it is difficult to determine which areas on which surfaces differ in a statistically significant sense. Furthermore, anthropological settings often suffer from low sample sizes, calling the reliability of parametric methods into question. Motivated by these problems, we propose to a novel nonparametric statistical test based on apply manifold moving least-squares (MMLS), a technique for computing approximating manifolds for high-dimensional point clouds, to spaces of corresponding patches on a given collection of surfaces of interest. Specifically, we use the Riemannian metric of the learned manifold to define a notion of distance between any two patches, for which we have theoretical guarantees of approximation accuracy under mild assumptions, and propose a test based on bootstrapped distributions of distances on the manifold. As an application, we use this test in combination with standard statistical rank tests to quantitatively compare a molar found in the Nakwai region of Kenya with those of four different primate genera, the results of which may shed light on evolutionary timelines of primates. If time permits, we will briefly go over the detail of SAMS, the manifold learning-based registration process used in the course of this study. Joint work with Doug Boyer, Ingrid Daubechies, and Barak Sober. (TCPL 201) |
11:00 - 11:20 |
Shira Faigenbaum-Golovin: Fine registration of a collection of surfaces by studying the geometry of the base manifold ↓ Given a collection of surfaces with common key features, that have a rough correspondence, commonly the shape variation in the geometric configuration is considered. However, collating the geometric information of the triangular meshes is sometimes challenging due to the subtle variation of these shapes. A common practice is to model the collection of low-dimensional manifolds as a (nonlinear) fibre bundle with a connection. Even though the connection is known only approximately, its presence can be used to study the base manifold geometry using fibre bundle diffusion. In this talk, we propose a new method for efficiently registering such collections given initial correspondences with varying degrees of accuracy. This method leads to an improvement of the correspondence maps, which can then be exploited to study the base manifold geometry with greater accuracy. We demonstrate our methodology on a toy model, as well as on a collection of anatomical surfaces, and aim to shed light on questions in evolutionary anthropology. (Online) |
11:30 - 13:00 |
Lunch ↓ Lunch is served daily between 11:30am and 1:30pm in the Vistas Dining Room, the top floor of the Sally Borden Building. (Vistas Dining Room) |
13:00 - 13:30 |
Kingshuk Ghosh: Modeling Conformations of the Intrinsically Disordered Proteins: Integrating Theoretical Physics and Data Science ↓ Protein sequence — encoding the unique folded structure and consequently function — plays a profound role in biological information processing.
This central dogma, however, appears to be challenged by Intrinsically Disordered Proteins (IDP) that lack unique folded structure, requiring an ensemble-centric view point. Despite the high degree of disorder, IDPs have specific ensemble average conformational features and function that depend on the sequence, not just the composition of the amino acids. How do we unravel the hidden code relating sequence, conformational ensemble and function ? Borrowing principles and tools of theoretical polymer physics we are discovering elegant mathematical relations as a function of sequence (respecting exact placement of the amnio acids) giving rise to high ·dimensional matrices that serve as molecular blueprints of IDPs. These information rich matrices have dual role. First, they map sequence features to a map of three dimensional conformational features and provide insights on how to alter conformations (e.g collapse or expansion) using biological regulators (mutation, phosphorylation).
Second, these matrices combined with data science tools hold promise to detect functionally similar IDPs, an unmet challenge in IDPs due to failure of traditional tools of sequence/structure alignment developed for folded proteins. (TCPL 201) |
13:30 - 14:00 |
Assaf Amitai: Investigating a Universal Flu Vaccine's Development and SARS-CoV-2 Evolution via Viral Spike Geometry ↓ The evolution of circulating viruses is shaped by their need to evade antibody response, which mainly targets the glycoprotein (spike). However, this diversity explores the antigenic space unequally, allowing pathogens such as the influenza virus to impose complex immunodominance hierarchies that distract antibody responses away from crucial sites of virus vulnerability. We developed a computational model of affinity maturation to map the patterns of immunodominance that evolve upon immunization with natural and engineered displays (nano-particles) of hemagglutinin, the influenza vaccine antigen. In this talk, I will show how antibody responses can be focused upon functionally conserved, but immunologically recessive sites on the influenza spike that are the target of human broadly neutralizing antibodies -- a step toward a universal flu vaccine.
I will further show that geometry plays an integral part in shaping the evolution of the seasonal flu H1N1 and coronavirus spikes. Taking advantage of 3D models of the virus, we find that antibody pressure, through the geometrical organization of spikes on the viral surface, governs their spike mutability. Studying the mutability patterns of SARS-CoV-2, we find that over time, it acquired, at low frequency, several mutations at antibody-accessible positions, which could indicate possible escape as defined by our model. Hence, we offer a geometry-based approach to assess whether a pandemic virus is changing its mutational pattern to that indicative of a circulating virus. (TCPL 201) |
14:00 - 14:15 |
Geoffrey Woollard: Source distribution estimation in cryo-EM via amortized variational inference ↓ Inferring biomolecular heterogeneity with cryogenic sample electron microscopy (cryo-EM) is an ambiguous problem that benefits from distinctions and formalization. Depending on the way of representing the latent space, the underlying source distributions can become entangled in the observed distribution through the image formation model. Here I examine source distribution estimation in a minimal toy synthetic setting involving simple forms of heterogeneity, projection, and measurement noise. I show source distribution estimation in amortized variational inference with an evidence lower bound (ELBO) objective. I jointly infer parameters for an amortized posterior distribution of pose and heterogeneity on each measurement, and simultaneously update a running estimate of the source distribution. Although during the measurement process we may lose information about heterogeneity, we can still uniquely identify the underlying source distribution that best explains the data by making a commitment to a parametric forward model, and a distribution family for the latent heterogeneity. (TCPL 201) |
14:20 - 14:40 |
Aryan Tajmir Riahi (TCPL 201) |
15:00 - 15:30 |
Coffee Break (TCPL Foyer) |
15:00 - 15:15 |
Lilianna Houston: Building a classification tool for cell morphology simulations ↓ Morphologies of complex spatial structures like the cellular cytoskeleton are the product of a network of molecular interactions between proteins and other biomolecules. Learning about these molecular interactions by studying the evolution of the morphology with time is a difficult inverse problem. Simulations provide a controlled way of learning the role of different model parameters by selectively tuning their relative strengths and observing the resulting behavior. I used the Cytosim simulation package to explore the effect of motor proteins and cross linkers on cytoskeleton dynamics. While this forward approach of running simulation to generate data provided insights into changes in model dynamics with model parameters, it has also provided synthetic data for the inverse approach. In this second approach, we aim to distinguish between models from only the simulation output. For this purpose, I developed a tool that can qualitatively capture information from changes to a cell’s complex cytoskeleton. My tool is reasonably successful at distinguishing models by analyzing simulation data generated with different input parameters. This study systematically investigates the scope of this tool as a classification task, and indicates its potential use in analyzing experimental data. (TCPL 201) |
15:15 - 15:30 |
Wanxin Li: Using a Riemannian Elastic Metric for Statistical Analysis of Tumor Cell Shape Heterogeneity ↓ We examine how a specific instance of the elastic metric, the Square Root Velocity (SRV) metric, can be used to study and compare cellular morphologies from the contours they form on planar surfaces. We process a dataset of images from osteocarcoma (bone cancer) cells that includes different treatments known to affect the cell morphology, and perform a comparative statistical analysis between the linear and SRV metrics. Our study indicates superior performance of the SRV at capturing the cell shape heterogeneity, with a better separation between different cell groups when comparing their distance to their mean shape, as well as a better low dimensional representation when comparing stress statistics. Therefore, our study suggests the use of a Riemannian metric, such as the SRV as a potential tool to enhance morphological discrimination for large datasets of cancer cell images (TCPL 201) |
15:30 - 15:45 |
Clément Soubrier: End to end pipeline for cell shape dynamics analysis from atomic force microscopy time series ↓ Recent advances in coupling the nanoscale spatial resolution from Atomic Force Microscopy with continuous, long-term time-lapse imaging (LTTL-AFM) have allowed to study cell morphology dynamics at an unprecedented resolution. For mycobacteria, it has been shown that fundamental cellular processes (e.g., growth, division, and cell death) can be linked to specific shape patterns and phenotypes. However, quantifying the dynamics and biophysical heterogeneity of large population over time is still challenging due to the lack of automated tools and methods to systematically analyze time-series from LTTL-AFM.
In collaboration with HA Eskandarian (UCSF), we developed an end-to-end pipeline to quantify the biophysical heterogeneity of cells imaged by LTTL-AFM. This pipeline combines (1) machine learning-based methods for cell tracking and segmentation, (2) algorithms from computational geometry and image analysis for the detection of division events and lineage reconstruction, and (3) non-linear dimensionality reduction methods and metrics for statistical analysis. We applied our pipeline to LTTL-AFM imaging datasets of isogenic populations of M. smegmatis cells and evaluated various biophysical features such as elongation rate, relative pole-elongation rates, and surface morphology upon division-site selection. We characterized and identified, for large populations, key determinants of bacterial cell division, as well as the phenotypic impact of various stress conditions imposed by antibiotic treatment on the mycobacterial cell surface. (TCPL 201) |
15:45 - 16:00 |
Huangqingbo Sun: CellOrganizer: Learning Morphological, Spatial, and Dynamic Models for Cellular and Subcellular Components ↓ Illustrations found in textbooks showcasing cell structures consist of hand-drawn cartoons or single images that fail to encompass the diverse range of morphological variations present in these structures. In the realm of cell biology, a significant shift is occurring, transitioning from crude approximations of cell structure shapes and spatial arrangements to the development of precise and spatially faithful models of cellular components. This transformation is being driven by computational techniques that aim to produce automated, concise, and statistically sound generative models of cellular organization using extensive biological datasets. An example of this progress is CellOrganizer, an open-source system for using microscope images to learn statistical models of the structure of cell components and how those components are organized relative to each other. These models effectively encapsulate the statistical deviations in the arrangement of cellular and subcellular elements by concurrently representing the distributions of their quantities, shapes, and spatial placements. Such models can be juxtaposed, and tailored to different cell types or conditions, to highlight discrepancies in their spatial structures. As generative models, they possess the ability to synthesize novel instances of cells based on the knowledge assimilated by the model. This also furnishes cell geometry information essential for subsequent biochemical simulation studies. (TCPL 201) |
16:00 - 16:15 |
Chenwei Zhang ↓ TBA (TCPL 201) |
16:15 - 16:30 |
Amil Khan: Cell Geometry: A Web-based Application for Cell Shape Analysis ↓ We present Cell Geometry, an open-source web-based platform for large-scale morphological cell shape analysis. Specifically, we focus on automating 3D cell segmentation and shape analysis using geometric machine learning. The goal of this project is twofold. The first is to perform these tasks at scale, wherein users can upload several terabytes of data and have all the processing done on our platform. Once done, users can download all of the segmentations, outputs of shape analysis such as tangent PCA, and any graphs/visualizations. The second is to make these advanced techniques in deep learning and differential geometry more accessible to researchers with less of a software background, and those who do not have access to the hardware resources to accomplish terabyte-scale analysis. Our segmentation method, CellECT 2.0 from BisQue, employs a rotation equivariant 3D UNET for accurate 3D cell segmentation. The output from this analysis gives us the surface coordinates which is used for the 3D shape analysis. Here, we use geomstats, an open-source Python package for computations and statistics on manifolds, to perform either 2D or 3D shape analysis such as computing mean shape---an important metric to distinguish between different experimental conditions, i.e. Normal vs. Disease. Join us as we launch the platform into a Beta Production Release! (TCPL 201) |
17:30 - 19:30 |
Dinner ↓ A buffet dinner is served daily between 5:30pm and 7:30pm in Vistas Dining Room, top floor of the Sally Borden Building. (Vistas Dining Room) |
20:00 - 23:59 |
Party (TCPL Foyer) |