Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships
Florian Huber Lars Ridder Stefan Verhoeven Jurriaan H. Spaaks Faruk Diblen Simon Rogers Justin J.J. van der Hooft
, 2020-8-12
Jurriaan H. Spaaks
Spaaks Jurriaan H.
ORCID: 0000-0002-7064-4069
Mmsoda Toolbox For Matlab 10.5281/ZENODO.1435005
Mmsoda Toolbox For Matlab 10.5281/ZENODO.1064714
Netherlands Escience Center Software Sustainability Protocol 10.5281/zenodo.1451751
Netherlands Escience Center Software Sustainability Protocol 10.5281/zenodo.1451750
Structure From Motion 10.5281/ZENODO.1435774
Pattyanalytics 10.5281/ZENODO.1436472
Pattyanalytics 10.5281/ZENODO.597210
Pattyanalytics 10.5281/ZENODO.1436473
Structure From Motion 10.5281/ZENODO.594751
Painting The Picture Of Software Impact With The Research Software Directory 10.5281/ZENODO.1412293
Painting The Picture Of Software Impact With The Research Software Directory 10.5281/ZENODO.1412292
Nlesc/Xenon-Poster-Escience-Symposium-2017: Xenon Poster Presented At The National Escience Symposium 2017 10.5281/zenodo.1013063
Nlesc/Xenon-Poster-Escience-Symposium-2017: Xenon Poster Presented At The National Escience Symposium 2017 10.5281/zenodo.1013062
Jspaaks/Inverse-Modeling-2017: Third Release 10.5281/zenodo.1064599
Jspaaks/Inverse-Modeling-2017: Third Release 10.5281/zenodo.1064598
Nlesc/Esibayes: Version 1.0.2 10.5281/zenodo.1064715
Jspaaks/Ampersandbox: Testing Zenodo-Github Integration 10.5281/zenodo.1197168
Towards A Better Understanding Of Bird Migration 10.5281/zenodo.1064086
Towards A Better Understanding Of Bird Migration 10.5281/zenodo.1064085
Tutorial Distributed Computing With Xenon 1.1.0 10.5281/zenodo.1064100
Tutorial Distributed Computing With Xenon 1.1.0 10.5281/zenodo.1064099
Nlesc/Xenon-Cli: V1.1.0-Rc1 10.5281/zenodo.838671
Nlesc/Xenon-Cli: V1.1.0-Rc1 10.5281/zenodo.838990
Nlesc/Boatswain: Release 1.0.2 10.5281/zenodo.1149011
Nlesc/Boatswain: Release 1.0.2 10.5281/zenodo.1149010
Nlesc/Xenon-Cli: V2.0.0 10.5281/zenodo.1043245
Citation-File-Format/Cff-Converter-Python: 0.0.1 10.5281/zenodo.1228727
Nlesc/Xenon-Cli: V2.1.0 10.5281/zenodo.1174017
Citationcff/Citationcff: Citationcff V0.0.0 10.5281/zenodo.1162058
Nlesc/Xenon-Rse2017-Tutorial: Same As 1.0.0, Trying To Trigger Zenodo 10.5281/zenodo.1064516
Nlesc/Xenon-Rse2017-Tutorial: Same As 1.0.0, Trying To Trigger Zenodo 10.5281/zenodo.1064515
Nlesc/Xenon-Examples: Version 1.0.6 10.5281/zenodo.1064654
Nlesc/Xenon-Examples: Version 1.0.6 10.5281/zenodo.1064653
Structure-From-Motion: Structure From Motion Pipeline 10.5281/zenodo.45937
How the Netherlands eScience Center uses CFF to promote software citation 10.6084/m9.figshare.7053608
How the Netherlands eScience Center uses CFF to promote software citation 10.6084/m9.figshare.7053608.v1
Inverse Modeling For Improving Hydrological, Environmental And Ecological Models 10.5281/zenodo.1064106
Inverse Modeling For Improving Hydrological, Environmental And Ecological Models 10.5281/zenodo.1064105
Xenon Tutorial Rse 2017 10.5281/zenodo.1064674
Xenon Tutorial Rse 2017 10.5281/zenodo.1064673
Nlesc/Structure-From-Motion: 1.0.2 10.5281/zenodo.1051014
Research Software Directories 10.5281/zenodo.1215481
Research Software Directories 10.5281/zenodo.1215480
Pattyanalytics: 1.0.0 10.5281/zenodo.45925
Fair Software - Towards Fully Open Science And Software Impact 10.5281/zenodo.1221346
Fair Software - Towards Fully Open Science And Software Impact 10.5281/zenodo.1221345
Nlesc/Xenon: This Is Release 2.0.0 Of Xenon. 10.5281/zenodo.1036769
Nlesc/Xenon: This Is Release Candidate 1 Of Xenon 2.0.0. 10.5281/zenodo.838670
Nlesc/Xenon: This Is Release Candidate 1 Of Xenon 2.0.0. 10.5281/zenodo.838984
Research Software Directory 10.5281/zenodo.1305971
Research Software Directory 10.5281/zenodo.1154130
Research Software Directory - Putting The F In Fair Software 10.5281/ZENODO.1469649
Research Software Directory - Putting The F In Fair Software 10.5281/ZENODO.1469648
Painting The Picture Of Software Impact With The Research Software Directory 10.5281/zenodo.1477863
Painting The Picture Of Software Impact With The Research Software Directory 10.5281/zenodo.1477864
Make software findable with the Research Software Directory 10.5281/zenodo.2558482
Make software findable with the Research Software Directory 10.5281/zenodo.2558481
Research Software Directory 10.5281/zenodo.2609141
The Research Software Directory and how it promotes Findability, Citability, and Reproducibility of Research Software 10.5281/zenodo.3247441
The Research Software Directory and how it promotes Findability, Citability, and Reproducibility of Research Software 10.5281/zenodo.3247440
Citation File Format (CFF) - Specifications 10.5281/zenodo.1003149
Citation File Format (CFF) - Specifications 10.5281/zenodo.3515946
Five recommendations for FAIR software - hands-on session 10.5281/zenodo.3547878
Five recommendations for FAIR software - hands-on session 10.5281/zenodo.3547879
Research Software Directory 10.5281/zenodo.3631783
Xenon command line interface 10.5281/zenodo.3724030
Xenon command line interface 10.5281/zenodo.597603
Xenon gRPC server 10.5281/zenodo.3724051
Xenon gRPC server 10.5281/zenodo.1043481
presentation workshop fair-software 10.5281/zenodo.3859356
presentation workshop fair-software 10.5281/zenodo.3859355
Research Software Directory 10.5281/zenodo.3876805
Research Software Directory 10.5281/zenodo.3759734
Research Software Directory 10.5281/zenodo.3761184
Self-hosted runners 10.5281/zenodo.3904266
Self-hosted runners 10.5281/zenodo.3904265
Self-hosted runners 10.5281/zenodo.3904800
Report on the Workshop on Sustainable Software Sustainability 2019 (WOSSS19) 10.5281/zenodo.3922155
Report on the Workshop on Sustainable Software Sustainability 2019 (WOSSS19) 10.5281/zenodo.3922154
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships 10.1101/2020.08.11.245928
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships 10.1371/journal.pcbi.1008724
Netherlands eScience Center Python Template 10.5281/zenodo.1310751
Faruk Diblen
Diblen Faruk
ORCID: 0000-0002-0989-929X
matchms - processing and similarity evaluation of mass spectrometry data 10.1101/2020.08.06.239244
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships 10.1101/2020.08.11.245928
matchms - processing and similarity evaluation of mass spectrometry data. 10.21105/joss.02411
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships 10.1371/journal.pcbi.1008724
Florian Huber
Huber Florian
ORCID: 0000-0002-3535-9406
Cytoskeletal crosstalk: When three different personalities team up 10.1016/j.ceb.2014.10.005
Robust organizational principles of protrusive biopolymer networks in migrating living cell 10.1371/journal.pone.0014471
Self-regulative organization of the cytoskeleton 10.1002/cm.20509
Emergent complexity of the cytoskeleton: From single filaments to tissue 10.1080/00018732.2013.771509
Counterion-induced formation of regular actin bundle networks 10.1039/c1sm06019h
Actin-microtubule coordination at growing microtubule ends 10.1038/ncomms5778
In vitro reconstitution of dynamic microtubules interacting with actin filament networks 10.1016/B978-0-12-397924-7.00017-0
Growing actin networks form lamellipodium and lamellum by self-assembly 10.1529/biophysj.108.134817
Formation of regularly spaced networks as a general feature of actin bundle condensation by entropic forces 10.1088/1367-2630/17/4/043029
Actin droplet machine 10.1098/rsos.191135
GGIR: A Research Community–Driven Open Source R Package for Generating Physical Activity and Sleep Outcomes From Multi-Day Raw Accelerometer Data 10.1123/jmpb.2018-0063
Computing on actin bundles network 10.1038/s41598-019-51354-y
Actin networks voltage circuits 10.1103/PhysRevE.101.052314
matchms - processing and similarity evaluation of mass spectrometry data 10.1101/2020.08.06.239244
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships 10.1101/2020.08.11.245928
matchms - processing and similarity evaluation of mass spectrometry data. 10.21105/joss.02411
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships 10.1371/journal.pcbi.1008724
MS2DeepScore - a novel deep learning similarity measure for mass fragmentation spectrum comparisons 10.1101/2021.04.18.440324
Advances in decomposing complex metabolite mixtures using substructure- and network-based computational metabolomics approaches 10.1039/D1NP00023C
Simon Rogers
Rogers Simon
ORCID: 0000-0003-3578-4477
An Optimal Frequency in Ca2+ Oscillations for Stomatal Closure Is an Emergent Property of Ion Transport in Guard Cells 10.1104/pp.15.01607
Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets. 10.1093/bioinformatics/btv072
MetAssign: probabilistic annotation of metabolites from LC-MS data using a Bayesian clustering approach. 10.1093/bioinformatics/btu370
Probabilistic assignment of formulas to mass peaks in metabolomics experiments. 10.1093/bioinformatics/btn642
Investigating the correspondence between transcriptomic and proteomic expression profiles using coupled cluster models. 10.1093/bioinformatics/btn553
Bayesian model-based inference of transcription factor activity. 10.1186/1471-2105-8-S2-S2
Identification of prognostic signatures in breast cancer microarray data using Bayesian techniques. 10.1098/rsif.2005.0093
A Bayesian regression approach to the inference of regulatory networks from gene expression data. 10.1093/bioinformatics/bti487
The latent process decomposition of cDNA microarray data sets. 10.1109/TCBB.2005.29
HDP-Align: Hierarchical Dirichlet Process Clustering for Multiple Peak Alignment of Liquid Chromatography Mass Spectrometry Data 10.1101/074831
Topic modeling for untargeted substructure exploration in metabolomics 10.1073/pnas.1608041113
Global Sensitivity Analysis of OnGuard Models Identifies Key Hubs for Transport Interaction in Stomatal Dynamics 10.1104/pp.17.00170
Unexpected Connections between Humidity and Ion Transport Discovered Using a Model to Bridge Guard Cell-to-Leaf Scales 10.1105/tpc.17.00694
Minimally-destructive atmospheric ionisation mass spectrometry authenticates authorship of historical manuscripts 10.1038/s41598-018-28810-2
Deciphering complex metabolite mixtures by unsupervised and supervised substructure discovery and semi-automated annotation from MS/MS spectra 10.1101/491506
Deciphering complex metabolite mixtures by unsupervised and supervised substructure discovery and semi-automated annotation from MS/MS spectra 10.1039/C8FD00235E
MolNetEnhancer: enhanced molecular networks by integrating metabolome mining and annotation tools 10.1101/654459
EZ-Root-VIS: A Software Pipeline for the Rapid Analysis and Visual Reconstruction of Root System Architecture 10.1104/pp.18.00217
MolNetEnhancer: Enhanced Molecular Networks by Integrating Metabolome Mining and Annotation Tools 10.3390/metabo9070144
In-silico Optimisation of Mass Spectrometry Fragmentation Strategies in Metabolomics 10.1101/744227
In Silico Optimization of Mass Spectrometry Fragmentation Strategies in Metabolomics 10.3390/metabo9100219
Gestational-age-dependent development of the neonatal metabolome 10.1101/2020.03.27.20045534
Universal MS/MS Visualization and Retrieval with the Metabolomics Spectrum Resolver Web Service 10.1101/2020.05.09.086066
Predicting host taxonomic information from viral genomes: A comparison of feature representations 10.1371/journal.pcbi.1007894
Decomposing metabolite set activity levels with PALS 10.1101/2020.06.07.138974
Ranking microbial metabolomic and genomic links in the NPLinker framework using complementary scoring functions 10.1101/2020.06.12.148205
matchms - processing and similarity evaluation of mass spectrometry data 10.1101/2020.08.06.239244
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships 10.1101/2020.08.11.245928
Probabilistic Framework for Integration of Mass Spectrum and Retention Time Information in Small Molecule Identification 10.1101/2020.08.19.255653
matchms - processing and similarity evaluation of mass spectrometry data. 10.21105/joss.02411
Rapid Development of Improved Data-dependent Acquisition Strategies 10.1101/2020.09.11.293092
User guide to the red blood cell model (RCM), a multiplatform JAVA-based model of human red blood cell homeostasis 10.1101/2020.03.07.981779
Probabilistic framework for integration of mass spectrum and retention time information in small molecule identification 10.1093/bioinformatics/btaa998
Comparative Metabologenomics Analysis of Polar Actinomycetes 10.3390/md19020103
Ranking Metabolite Sets by Their Activity Levels 10.3390/metabo11020103
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships 10.1371/journal.pcbi.1008724
Up-down biphasic volume response of human red blood cells to PIEZO1 activation during capillary transits 10.1371/journal.pcbi.1008706
PIEZO1 and the mechanism of the long circulatory longevity of human red blood cells 10.1371/journal.pcbi.1008496
Rapid Development of Improved Data-Dependent Acquisition Strategies 10.1021/acs.analchem.0c03895
Advances in decomposing complex metabolite mixtures using substructure- and network-based computational metabolomics approaches 10.1039/D1NP00023C
Using topic modeling to detect cellular crosstalk in scRNA-seq 10.1101/2021.07.26.453767
Justin van der Hooft
van der Hooft Justin / Justin J.J. van der Hooft
ORCID: 0000-0002-9340-5511
Structural elucidation of low abundant metabolites in complex sample matrices 10.1007/s11306-013-0519-8
Non-smoky GLYCOSYLTRANSFERASE1 prevents the release of smoky aroma from tomato fruit 10.1105/tpc.113.114231
Automatic chemical structure annotation of an LC-MSn based metabolic profile from green tea 10.1021/ac400861a
Substructure-based annotation of high-resolution multistage MSn spectral trees 10.1002/rcm.6364
Structural annotation and elucidation of conjugated phenolic compounds in black, green, and white tea extracts 10.1021/jf300297y
Structural elucidation and quantification of phenolic conjugates present in human urine after tea intake 10.1021/ac3017339
Spectral trees as a robust annotation tool in LC-MS based metabolomics 10.1007/s11306-011-0363-7
Metabolite identification using automated comparison of high-resolution multistage mass spectral trees 10.1021/ac2034216
Inhibitory activity of plumbagin produced by Drosera intermedia on food spoilage fungi 10.1002/jsfa.5522
A strategy for fast structural elucidation of metabolites in small volume plant extracts using automated MS-guided LC-MS-SPE-NMR 10.1002/mrc.2833
Thermal diffusivity of periderm from tomatoes of different maturity stages as determined by the concept of the frequency-domain open photoacoustic cell 10.1063/1.3530735
Polyphenol identification based on systematic and robust high-resolution accurate mass spectrometry fragmentation 10.1021/ac102546x
Iridoid and caffeoyl phenylethanoid glycosides of the endangered carnivorous plant Pinguicula lusitanica L. (Lentibulariaceae) 10.1016/j.bse.2009.05.003
Topic modeling for untargeted substructure exploration in metabolomics 10.1073/pnas.1608041113
Propagating annotations of molecular networks using in silico fragmentation 10.1371/journal.pcbi.1006089
Did a plant-herbivore arms race drive chemical diversity in Euphorbia? 10.1101/323014
Automatic Compound Annotation from Mass Spectrometry Data Using MAGMa. 10.5702/massspectrometry.S0033
Interactions of blacktea polyphenols with human gut microbiota: implications for gut and cardiovascular health 10.3945/ajcn.113.058263
Identification of a drimenol synthase and drimenol oxidase from Persicaria hydropiper, involved in the biosynthesis of insect deterrent drimanes 10.1111/tpj.13527
Enhanced Acylcarnitine Annotation in High-Resolution Mass Spectrometry Data: Fragmentation Analysis for the Classification and Annotation of Acylcarnitines 10.3389/fbioe.2015.00026
Rapid and sustained systemic circulation of conjugated gut microbiol catabolites after single-dose black tea extract consumption 10.1021/pr5001253
In Silico Prediction and Automatic LC–MSn Annotation of Green Tea Metabolites in Urine 10.1021/ac403875b
Unexpected differential metabolic responses of Campylobacter jejuni to the abundant presence of glutamate and fucose 10.1007/s11306-018-1438-5
Comprehensive mass spectrometry-guided plant specialized metabolite phenotyping reveals metabolic diversity in the cosmopolitan plant family Rhamnaceae 10.1101/463620
Deciphering complex metabolite mixtures by unsupervised and supervised substructure discovery and semi-automated annotation from MS/MS spectra 10.1101/491506
Accelerating metabolite identification in natural product research: toward an ideal combination of LC-HRMS/MS and NMR profiling, in silico databases and chemometrics 10.1021/acs.analchem.8b05112
Untargeted Mass Spectrometry-Based Metabolomics Tracks Molecular Changes in Raw and Processed Foods and Beverages 10.1101/347716
Absorption, metabolism, distribution and excretion of (−)-epicatechin: A review of recent findings https://doi.org/10.1016/j.mam.2017.11.002
Metabolite Identification in Complex Mixtures Using Nuclear Magnetic Resonance Spectroscopy 10.1007/978-3-319-28275-6_6-2
Ms2lda.org: web-based topic modelling for substructure discovery in mass spectrometry 10.1093/bioinformatics/btx582
A comprehensive evaluation of the [2-14C](–)-epicatechin metabolome in rats https://doi.org/10.1016/j.freeradbiomed.2016.08.001
The food metabolome: a window over dietary exposure 10.3945/ajcn.113.076133
Human studies on the absorption, distribution, metabolism, and excretion of tea polyphenols 10.3945/ajcn.113.058958
Unsupervised Discovery and Comparison of Structural Families Across Multiple Samples in Untargeted Metabolomics 10.1021/acs.analchem.7b01391
A community-driven paired data platform to accelerate natural product mining by combining structural information from genomes and metabolomes 10.18174/FAIRdata2018.16286
Implementations of the chemical structural and compositional similarity metric in R and Python 10.1101/546150
Comprehensive mass spectrometry-guided phenotyping of plant specialized metabolites reveals metabolic diversity in the cosmopolitan plant family Rhamnaceae 10.1111/tpj.14292
A genetical metabolomics approach for bioprospecting plant biosynthetic gene clusters 10.1186/s13104-019-4222-3
The Metabolomics Society—Current State of the Membership and Future Directions 10.3390/metabo9050089
Deciphering complex metabolite mixtures by unsupervised and supervised substructure discovery and semi-automated annotation from MS/MS spectra 10.1039/C8FD00235E
MolNetEnhancer: enhanced molecular networks by integrating metabolome mining and annotation tools 10.1101/654459
Substantial extracellular metabolic differences found between phylogenetically closely related probiotic and pathogenic strains of Escherichia coli 10.3389/fmicb.2019.00252
Assessing specialized metabolite diversity in the cosmopolitan plant genus Euphorbia l. 10.3389/fpls.2019.00846
Molnetenhancer: Enhanced molecular networks by integrating metabolome mining and annotation tools 10.3390/metabo9070144
In Silico Optimization of Mass Spectrometry Fragmentation Strategies in Metabolomics 10.3390/metabo9100219
MIBiG 2.0: a repository for biosynthetic gene clusters of known function 10.1093/nar/gkz882
Untargeted mass spectrometry-based metabolomics approach unveils molecular changes in raw and processed foods and beverages 10.1016/j.foodchem.2019.125290
The Natural Products Atlas: An Open Access Knowledge Base for Microbial Natural Products Discovery 10.1021/acscentsci.9b00806
Mass spectrometry searches using MASST 10.1038/s41587-019-0375-9
Assessing specialized metabolite diversity of Alnus species by a digitized LC–MS/MS data analysis workflow 10.1016/j.phytochem.2020.112292
Microbiome-derived carnitine mimics as previously unknown mediators of gut-brain axis communication 10.1126/sciadv.aax6328
Chemically-informed Analyses of Metabolomics Mass Spectrometry Data with Qemistree 10.1101/2020.05.04.077636
Linking genomics and metabolomics to chart specialized metabolic diversity 10.1039/D0CS00162G
Reproducible molecular networking of untargeted mass spectrometry data using GNPS 10.1038/s41596-020-0317-5
Decomposing metabolite set activity levels with PALS 10.1101/2020.06.07.138974
Ranking microbial metabolomic and genomic links in the NPLinker framework using complementary scoring functions 10.1101/2020.06.12.148205
Phenylpropane as an Alternative Dearomatizing Unit of Indoles: Discovery of Inaequalisines A and B Using Substructure-Informed Molecular Networking 10.1021/acs.orglett.0c02153
matchms - processing and similarity evaluation of mass spectrometry data 10.1101/2020.08.06.239244
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships 10.1101/2020.08.11.245928
BiG-SLiCE: A Highly Scalable Tool Maps the Diversity of 1.2 Million Biosynthetic Gene Clusters 10.1101/2020.08.17.240838
matchms - processing and similarity evaluation of mass spectrometry data. 10.21105/joss.02411
ReDU: a framework to find and reanalyze public mass spectrometry data 10.1038/s41592-020-0916-7
Feature-based molecular networking in the GNPS analysis environment 10.1038/s41592-020-0933-6
Rapid Development of Improved Data-dependent Acquisition Strategies 10.1101/2020.09.11.293092
Chemically informed analyses of metabolomics mass spectrometry data with Qemistree 10.1038/s41589-020-00677-3
Auto-deconvolution and molecular networking of gas chromatography–mass spectrometry data 10.1038/s41587-020-0700-3
The ant fungus garden acts as an external digestive system 10.1101/2020.11.18.389361
BiG-MAP: an automated pipeline to profile metabolic gene cluster abundance and expression in microbiomes 10.1101/2020.12.14.422671
BiG-SLiCE: A highly scalable tool maps the diversity of 1.2 million biosynthetic gene clusters 10.1093/gigascience/giaa154
Veterinary trypanocidal benzoxaboroles are peptidase-activated prodrugs 10.1371/journal.ppat.1008932
Comparative Metabologenomics Analysis of Polar Actinomycetes 10.3390/md19020103
Ranking Metabolite Sets by Their Activity Levels 10.3390/metabo11020103
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships 10.1371/journal.pcbi.1008724
Rapid Development of Improved Data-Dependent Acquisition Strategies 10.1021/acs.analchem.0c03895
MS2DeepScore - a novel deep learning similarity measure for mass fragmentation spectrum comparisons 10.1101/2021.04.18.440324
Ranking microbial metabolomic and genomic links in the NPLinker framework using complementary scoring functions 10.1371/journal.pcbi.1008920
Metabolomics-Guided Elucidation of Plant Abiotic Stress Responses in the 4IR Era: An Overview 10.3390/metabo11070445
Chemical Gradients of Plant Substrates in an Atta texana Fungus Garden 10.1128/mSystems.00601-21
Comprehensive Large-Scale Integrative Analysis of Omics Data To Accelerate Specialized Metabolite Discovery 10.1128/mSystems.00726-21