Machine Learning Intern
About Us
Isospec Analytics SA seeks an intern to develop cutting-edge embedding techniques for glycomics data. We aim to leverage our proprietary CIRIS technology to create powerful new representations of small molecules structures and their associated data.
Your Impact
Your primary focus will be on developing novel embedding models for LC-IMS-IR-MS data, with the goal of creating versatile representations that can be applied to a wide range of downstream tasks in glycobiology. This project sits at the exciting intersection of artificial intelligence and glycoscience.
What You Will Do
- Design and implement innovative embedding techniques for LC-IMS-IR-MS spectral data
- Explore unsupervised and self-supervised learning approaches for pre-training embeddings
- Investigate the potential of these embeddings for various applications, such as:
- Glycan structure prediction and classification
- Biological pathway inference
- Ontology enrichment and refinement
- Integration with existing glycan databases and knowledge bases
- Develop methods to incorporate domain knowledge into embedding models
- Explore cross-modal embeddings that combine spectral data with textual information from scientific literature
- Utilize advanced AI architectures (e.g., transformers, graph neural networks) to create robust embeddings
- Implement and adapt techniques from NLP and computer vision to spectral data analysis
- Develop strategies for efficient fine-tuning of pre-trained embeddings for specific glycobiology tasks
- Investigate transfer learning potential between different types of spectral data
- Integrate embedding models with public glycan databases (e.g., GlyTouCan, GLYCOSCIENCES.de)
- Develop tools to enrich glycan ontologies using the created embeddings
- Explore the use of Large Language Models (LLMs) to extract glycan-related information from scientific literature
- Investigate methods to improve data interoperability in glycomics research
Skills and Qualifications
Essentials
- Strong Python programming skills
- Demonstrated experience with AI/ML projects: show us how did you tackle a problem you had at hand with ML and what tools/frameworks you have used to deliver and evaluate an up and running solution
- Excellent communicator
- Critical scientific thinking. Apprehend problem from first principles and dissect the delivery in incremental steps with quantifiable results.
Nice To Have
- Implemented embedding techniques models and latent space search
- Experience with spectral data analysis and understanding of bioinformatics concept
- Worked with spectroscopy and mass spectrometry data
- Interdisciplinary research bridging AI, chemistry, and biology
Culture and Perks
At Isospec, you'll work in a dynamic setting where your innovations can directly impact small molecule research. We offer:
- Opportunity to implement AI system from scratch and deploy in production with real impact
- Experience fast paced environment. Level up coding pace to meet start up dynamic culture
- Autonomy and independence in approaches and design decisions of your systems
Also if you are interested in the startup ecosystem, you will get a chance to get exposed to all facets of it and contribute to the success of the company.