3. Data, Science & AI

24 skills

Found 9989 skills

Total Stars:6.7M
Avg Stars:667

pylabrobot

davila7

18.0K

Automates laboratory workflows by controlling liquid handlers, plate readers, and lab equipment, supporting both simulation and physical hardware execution.

Lab Automation
Liquid Handling
Plate Readers
3. Data, Science & AI

datacommons-client

davila7

18.0K

Provides programmatic access to Data Commons for querying global statistical data including demographics, economics, health, and environmental metrics.

Data Commons
API
Statistical Data
3. Data, Science & AI

opentargets-database

davila7

18.0K

Queries Open Targets Platform for target-disease associations, drug target data, and therapeutic evidence to support biomedical research and drug discovery.

Open Targets
Target-Disease Associations
Drug Discovery
3. Data, Science & AI

stable-diffusion-image-generation

davila7

18.0K

Generates images from text prompts using Stable Diffusion models via HuggingFace Diffusers.

Stable Diffusion
HuggingFace Diffusers
Text-to-image
3. Data, Science & AI

pyvene-interventions

davila7

18.0K

Guides causal interventions on PyTorch models using the pyvene framework for causal tracing, activation patching, and hypothesis testing.

PyTorch
Causal Interventions
Pyvene
3. Data, Science & AI

pathml

davila7

18.0K

Computational pathology toolkit for AI-driven analysis of whole-slide images, multiplex imaging, and spatial proteomics data in medical workflows.

Whole-slide imaging
Multiplex immunofluorescence
Spatial proteomics
3. Data, Science & AI

ray-train

davila7

18.0K

Orchestrates distributed training for PyTorch, TensorFlow, and HuggingFace models across clusters, supporting hyperparameter tuning and scaling to thousands of nodes.

Ray
Distributed Training
Hyperparameter Tuning
3. Data, Science & AI

huggingface-tokenizers

davila7

18.0K

High-performance NLP tokenization supporting BPE, WordPiece, and Unigram. Integrates with Hugging Face Transformers for research and production.

Tokenization
BPE
Transformers
3. Data, Science & AI

ensembl-database

davila7

18.0K

Provides programmatic access to Ensembl's REST API for genomic data retrieval, including gene, sequence, and variant analysis across multiple species.

Ensembl
REST API
Genomic Data
3. Data, Science & AI

pymatgen

davila7

18.0K

Computational materials science toolkit for crystal structure analysis, band structure calculations, and Materials Project integration.

Crystal Structures
Band Structure
Materials Project
3. Data, Science & AI

etetoolkit

davila7

18.0K

Toolkit for phylogenetic tree manipulation, evolutionary event detection, and visualization using NCBI taxonomy in phylogenomics research.

Phylogenetics
Newick
NCBI Taxonomy
3. Data, Science & AI

denario

davila7

18.0K

Automates scientific research workflows from data analysis to LaTeX publication via multiagent AI, supporting methodology development and literature searches.

Multiagent AI
Research Automation
LaTeX
3. Data, Science & AI

lambda-labs-gpu-cloud

davila7

18.0K

Offers on-demand and reserved GPU cloud instances for ML training/inference with SSH access, persistent storage, and multi-node clusters.

GPU Cloud
Machine Learning
Multi-node Clusters
3. Data, Science & AI

gwas-database

davila7

18.0K

Queries NHGRI-EBI GWAS Catalog for SNP-trait associations, p-values, and summary statistics to support genetic epidemiology and polygenic risk score analysis.

GWAS
SNP
Polygenic Risk Scores
3. Data, Science & AI

geo-database

davila7

18.0K

Accesses NCBI GEO database to search, download, and retrieve gene expression datasets (microarray/RNA-seq) in SOFT/Matrix formats for transcriptomics analysis.

NCBI GEO
Gene Expression
Transcriptomics
3. Data, Science & AI

latchbio-integration

davila7

18.0K

Builds and deploys bioinformatics workflows with Latch SDK, Nextflow/Snakemake integration, and LatchFile/LatchDir data handling.

Latch SDK
Nextflow
Snakemake
3. Data, Science & AI

pufferlib

davila7

18.0K

Provides tools for reinforcement learning development, including PPO training, custom environment creation, and integration with Gymnasium and PettingZoo.

PPO
Gymnasium
Multi-agent
3. Data, Science & AI

scikit-bio

davila7

18.0K

Provides tools for biological sequence analysis, phylogenetic tree construction, and microbiome diversity metrics including UniFrac and PERMANOVA.

Phylogenetics
UniFrac
PERMANOVA
3. Data, Science & AI

pydicom

davila7

18.0K

Python library for reading, writing, and processing DICOM medical imaging files, including metadata, pixel data, and anonymization.

DICOM
Medical Imaging
Image Processing
3. Data, Science & AI

quantizing-models-bitsandbytes

davila7

18.0K

Quantizes LLMs to 8-bit or 4-bit, reducing memory usage by 50-75% with minimal accuracy loss for efficient inference.

Quantization
HuggingFace
QLoRA
3. Data, Science & AI

evaluating-llms-harness

davila7

18.0K

Evaluates large language models against 60+ academic benchmarks, supporting industry-standard tools for model comparison and research reporting.

LLM Evaluation
HuggingFace
vLLM
3. Data, Science & AI

phoenix-observability

davila7

18.0K

Open-source platform for tracing, evaluating, and monitoring LLM applications to debug, assess performance, and gain real-time production insights.

LLM Tracing
Model Evaluation
AI Monitoring
3. Data, Science & AI

chroma

davila7

18.0K

Open-source database for storing AI embeddings and metadata, enabling vector search and RAG applications with a simple API.

Embeddings
Vector Search
RAG
3. Data, Science & AI

sentence-transformers

davila7

18.0K

Framework for generating high-quality text and image embeddings using pre-trained models, optimized for semantic search, RAG, and similarity tasks in production environments.

sentence-transformers
embeddings
RAG
3. Data, Science & AI
PreviousPage 9 of 417 PageNext