This is Python training and testing code for Locally Optimized Product Quantization (LOPQ) models, as well as Spark scripts to scale training to hundreds of millions of vectors. The resulting model ...
Automatic detection of macromolecular complexes is an open and challenging problem in cellular cryoelectron tomography. Existing computational methods rely on known structural templates or manually ...
The aim of this project is to aggregate, polish, and standardise the existing clustering benchmark batteries referred to across the machine learning and data mining literature, and to introduce new ...
Haplotype identification, characterization and visualization are important for large-scale analysis and use in population genomics. Many tools have been developed to visualize haplotypes, but it is ...
Here we developed an open-source Python-based library called Python rodent Analysis and Tracking (PyRAT). Our library analyzes tracking data to classify distinct behaviors, estimate traveled distance, ...
Creative Commons (CC): This is a Creative Commons license. Attribution (BY): Credit must be given to the creator. Non-Commercial (NC): Only non-commercial uses of the work are permitted. No ...
Natural phenomena are teeming with temporal complexity, but such dynamics, however fascinating, offer substantial obstacles to quantitative understanding. We introduce a general method based on the ...