Page under construction
Home :: Selected Projects and Publications :: ::
::
Summary
This project introduces DynACD, a novel probabilistic machine learning model for detecting anomalies in dynamic networks.
DynACD integrates community detection to establish a baseline of typical network behavior,
identifying anomalies as deviations from this established structure. The model is evaluated on both synthetic and real-world datasets,
demonstrating superior performance compared to existing methods. A case study using Transfermarkt dataset, which contains information on player transfers
among European men’s football clubs, showcases DynACD’s ability
to detect anomalies influenced by factors like club wealth and national leagues, as well as identify potential data errors. The model’s
probabilistic framework allows for the interpretation of anomaly detection results, enhancing its practical usability.
(Commun Phys, , Nature Portfolio 7, 397 (2024)) (GitHub)
Figure 1: Genoa transfer network. Visualization of player transfers to and from Genoa involving various clubs at different time steps. There is a consistent presence of transfers with Juventus and Inter Milan at most time steps, T, shown by red edges.
Summary
This project developes a novel probabilistic generative model, called CRep, designed to analyze and model reciprocity and community structure within directed networks. Unlike previous models that treat these aspects separately or make simplifying assumptions about edge independence, CRep incorporates both community memberships and a reciprocity parameter to more accurately represent the complex interplay between these factors in real-world networks. The model uses an efficient expectation-maximization algorithm for parameter inference and a benchmark generative model for synthetic data creation. The effectiveness of CRep is demonstrated through experiments on both synthetic and real-world datasets, showing superior performance in edge prediction and reproducing observed reciprocity values. The authors provide an open-source implementation of their code. (Phys. Rev. Research 3, 023209 (2021)) (GitHub)
Figure 2: Reciprocity Patterns in the Erasmus Network. This figure illustrates how different universities in the Erasmus student exchange network exhibit a range of reciprocity patterns in their partnerships with other universities. The figure visualizes these patterns in terms of how the outgoing edges of a given university are driven by academic preference versus the reciprocation of student exchanges from partner universities.
Summary
This interdisciplinary project introduces VIMuRe, a new probabilistic model for analyzing noisy, multiply-reported social network data.
The model accounts for individual reporter biases (over- or under-reporting) and “mutuality”—the tendency to report reciprocal relationships.
VIMuRe’s efficiency allows it to scale to large networks, utilizing variational inference.
The model is validated through simulations and applied to two empirical datasets (Nicaragua and Karnataka),
demonstrating improved accuracy in estimating network structure and reciprocity compared to traditional methods.
The findings highlight the importance of considering mutuality and reporter reliability when interpreting social network data.
(Journal of the Royal Statistical Society Series A: Statistics in Society (186) 3, 355–375 (2023)) (GitHub)
Figure 3: Illustration of reporter reliability and tie confirmation in a social support network from a Nicaraguan community. Pie charts highlight different configurations of reported and confirmed ties for selected reporters, emphasizing variations in reporting behavior and mutuality.
Summary
This work presents a probabilistic generative model for anomaly detection in networks, ACD, leveraging community structure to define regular interaction patterns. The model incorporates latent variables representing community memberships and anomalous edges, enabling simultaneous inference of both. An expectation-maximization algorithm efficiently estimates these variables, and the model’s performance is validated using synthetic and real-world datasets. Experiments demonstrate improved community detection by removing or adding edges identified as anomalies, showcasing the model’s flexibility and robustness. The findings suggest that understanding network community structure enhances the accuracy of anomaly detection. (Journal of Big Data (9), 122 (2022)) (GitHub)
Figure 4: The network of POLBOOKS (Books about US politics). The plot shows enhanced community detection through the removal of anomalous edges (panel (b)). Red edges indicate those identified as anomalies and removed before reanalysis. Red rectangles denote pairs of nodes that are connected by the edges inferred as anomalous. Cyan rectangles demonstrate the nodes that changed their community membership, after removing the aforementioned anomalous edges. Cyan edges in panel (a) present the edges connected to these. Panel (c) indicates Ground Truth Communities.
Summary
The interdisciplinary research project developes a noise-driven differentiation
(NDD) model of cell differentiation and pattern formation. The model
incorporates intrinsic cellular noise, stochastic cell division, and cell
signaling to explain how phenotypic diversity emerges and leads to
spatiotemporal patterns in cell populations. Simulations using a cell
aggregation model support the model’s predictions, showing that noise alone
can generate heterogeneity, while signaling introduces spatial order. The
authors compare their model to existing models, highlighting its unique
approach of separating and analyzing different noise sources. Finally, the
study discusses the implications of the NDD model for understanding the
evolutionary transition from unicellularity to multicellularity.
(Plos one 15 (4), e0232060 (2020)) (GitHub)
Figure 5: In a bistable switch, two attractors (red semicircles) and, consequently, two phenotypes are available: A and B. The likelihood of a switch choosing state A over B depends on the number of the transcription factor associated with state A (TFX) relative to the number of the transcription factor associated with state B (TFY), as well as the noise in its environment.
Summary
This research paper uses computational modeling and game theory to investigate the Crabtree effect in yeast, focusing on the switch between fermentation and respiration for ATP production. The authors propose a metabolic switch model where individual yeast cells dynamically adjust their strategy based on glucose availability, resulting in population-level heterogeneity. This contrasts with traditional game-theoretical models that assume distinct subpopulations with fixed strategies. The study uses stochastic simulations to explore the model’s behavior under varying glucose conditions and proposes an experimental method to distinguish between mixed-strategy and distinct-subpopulation scenarios. The findings could have implications for understanding cancer metabolism and developing new therapies. (Journal of Theoretical Biology 531, 110912 (2021)) (GitHub)
Figure 6: The proposed regulatory network imagines a simple switch between fermentation and respiration based on the experimental data.