3. Data, Science & AI

24 skills

Found 9989 skills

Total Stars:6.7M
Avg Stars:667

omero-integration

davila7

18.0K

Provides Python-based access to microscopy data management, enabling image retrieval, pixel analysis, ROI annotation, and batch processing for high-content screening workflows.

Python
Microscopy
ROI
3. Data, Science & AI

medchem

davila7

18.0K

Applies medicinal chemistry rules including Lipinski, Veber, and PAINS filters to prioritize compounds and filter chemical libraries.

Lipinski
Veber
PAINS
3. Data, Science & AI

hypogenic

davila7

18.0K

Automates scientific hypothesis generation and testing by combining LLMs with empirical data and literature for research discovery.

LLMs
Hypothesis Testing
Empirical Data
3. Data, Science & AI

pyhealth

davila7

18.0K

Comprehensive toolkit for building, testing, and deploying machine learning models using clinical data, healthcare datasets, and medical coding systems.

MIMIC-III
Deep Learning
ICD
3. Data, Science & AI

clip

davila7

18.0K

Enables zero-shot image classification, image-text matching, and cross-modal retrieval using a vision-language model trained on 400M image-text pairs.

CLIP
Zero-shot
Cross-modal
3. Data, Science & AI

arboreto

davila7

18.0K

Infers gene regulatory networks from transcriptomics data using scalable ML algorithms (GRNBoost2, GENIE3) for transcription factor-target gene analysis.

GRNBoost2
GENIE3
Transcriptomics
3. Data, Science & AI

ena-database

davila7

18.0K

Accesses European Nucleotide Archive (ENA) via API/FTP to retrieve DNA/RNA sequences, FASTQ files, and genome assemblies for bioinformatics analysis.

ENA
FASTQ
Genome Assembly
3. Data, Science & AI

adaptyv

davila7

18.0K

Cloud platform automating protein testing and validation, including binding assays, expression testing, and AI-driven sequence optimization using ESM.

Protein sequence optimization
ESM
Wet-lab validation
3. Data, Science & AI

senior-data-scientist

davila7

18.0K

Provides advanced data science capabilities including statistical modeling, A/B testing, and predictive analytics using Python, R, and SQL for data-driven decision making.

Python
A/B Testing
Statistical Modeling
3. Data, Science & AI

string-database

davila7

18.0K

Queries STRING API for protein-protein interactions, enabling network analysis and functional enrichment (GO/KEGG) across 5000+ species for systems biology research.

STRING
PPI
GO/KEGG
3. Data, Science & AI

langchain

davila7

18.0K

Framework for building LLM-powered applications with agents, chains, RAG, and multi-provider support for chatbots and question-answering systems.

RAG
Agents
Vector Stores
3. Data, Science & AI

clinpgx-database

davila7

18.0K

Provides access to ClinPGx pharmacogenomics database for querying gene-drug interactions, CPIC guidelines, and allele functions to support precision medicine decisions.

ClinPGx
Pharmacogenomics
Gene-drug interactions
3. Data, Science & AI

scientific-visualization

davila7

18.0K

Generates journal-ready scientific figures with multi-panel layouts, error bars, and colorblind-safe designs using matplotlib, seaborn, and plotly.

matplotlib
seaborn
plotly
3. Data, Science & AI

dask

davila7

18.0K

Enables parallel and distributed computing to scale pandas and NumPy operations on datasets exceeding memory capacity.

Dask
Pandas
NumPy
3. Data, Science & AI

clinvar-database

davila7

18.0K

Query ClinVar database for variant clinical significance, supporting gene/position searches and VCF annotation in genomic medicine.

ClinVar
VCF
E-utilities
3. Data, Science & AI

langsmith-observability

davila7

18.0K

Enables tracing, evaluation, and monitoring of LLM applications for debugging, model testing, and production system oversight.

LLM Observability
Model Evaluation
AI Monitoring
3. Data, Science & AI

excel-analysis

davila7

18.0K

Analyzes Excel spreadsheets, creates pivot tables, and generates charts for data visualization and tabular data analysis.

Excel
Pivot Tables
Data Visualization
3. Data, Science & AI

aeon

davila7

18.0K

Provides scikit-learn compatible APIs for time series machine learning tasks including forecasting, anomaly detection, and clustering on temporal data.

Time Series
Machine Learning
scikit-learn
3. Data, Science & AI

modal-serverless-gpu

davila7

18.0K

Serverless GPU platform for deploying ML models as APIs and running scalable batch jobs without infrastructure management.

Serverless
GPU
Machine Learning
3. Data, Science & AI

biopython

davila7

18.0K

Primary Python toolkit for molecular biology data processing, including sequence manipulation, file parsing, and BLAST workflows.

Biopython
Sequence Analysis
BLAST
3. Data, Science & AI

moe-training

davila7

18.0K

Trains Mixture of Experts (MoE) models efficiently using DeepSpeed or HuggingFace, reducing compute costs while enabling sparse architectures for large-scale AI systems.

MoE
DeepSpeed
HuggingFace
3. Data, Science & AI

gtars

davila7

18.0K

High-performance Rust toolkit with Python bindings for genomic interval analysis, BED file processing, and ML tokenization in computational genomics.

Genomic intervals
BED files
Rust
3. Data, Science & AI

dspy

davila7

18.0K

Framework for building AI systems with declarative programming, automatic prompt optimization, and modular RAG systems using DSPy.

DSPy
RAG
Prompt Optimization
3. Data, Science & AI

blip-2-vision-language

davila7

18.0K

Provides a vision-language pre-training framework for image captioning, visual question answering, and multimodal chat with zero-shot capabilities.

BLIP-2
Vision-Language
Zero-shot
3. Data, Science & AI
PreviousPage 7 of 417 PageNext