This is a live page with my latest reading notes for various scientific topics.
Data Mining
-
Ap-FSM, A parallel algorithm for approximate frequent subgraph mining using Pregel
-
ManIACS, Approximate Mining of Frequent Subgraph Patterns through Sampling
-
Pangolin, An efficient and flexible graph mining system on CPU and GPU
-
Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs
-
SLOSH, Set LOcality Sensitive Hashing via Sliced Wasserstein Embeddings
Cheminformatics
-
Learning Local Equivariant Representations for Large-Scale Atomistic Dynamics
-
Towards Effective and Generalizable Fine-Tuning for Pre-trained Molecular Graph Models
-
Self-Supervised Graph Transformer on Large-Scale Molecular Data
Bioinformatics
-
Protein Structure Comparison by Alignment of Distance Matrices
-
Learning Structural Motif Representations for Efficient Protein Structure Search
-
Frequent subgraph mining for biologically meaningful structural motifs
-
Structure-based function prediction using graph convolution networks
-
Hydrogen bonds meet self-attention, all you need for general-purpose protein structure embedding
-
Efficiently mining recurrent substructures from protein three-dimensional structure graphs
-
ProteinNet, a standardized data set for machine learning of protein structure
-
Efficient network-guided multi-locus association mapping with graph cuts