3. Data, Science & AI
Found 9989 skills
learn-from-pr
dotnet
Analyzes completed PRs with agent involvement to extract behavioral lessons, identify patterns, and generate actionable recommendations for improving agent skills and documentation.
domain-ml
rustfs
Enables development of machine learning and AI applications in Rust, supporting model training, inference, and deep learning with Rust libraries.
snowflake-semanticview
github
Creates, alters, and validates Snowflake semantic views using Snowflake CLI, including DDL validation and setup guidance.
model-merging
davila7
Merges fine-tuned AI models using techniques like SLERP and Task Arithmetic to combine domain expertise without retraining, enhancing performance and enabling rapid experimentation.
shap
davila7
Provides SHAP-based model interpretability for explaining predictions, feature importance, and bias analysis across ML models.
statistical-analysis
davila7
Comprehensive statistical analysis toolkit for academic research, including hypothesis testing, regression, Bayesian methods, and APA reporting.
pubchem-database
davila7
Queries PubChem database for chemical compounds, supporting searches by name, CID, SMILES, and retrieving properties, bioactivity, and similarity data.
biomni
davila7
Autonomous AI framework for biomedical research, executing complex tasks in genomics, drug discovery, and clinical analysis using LLM reasoning and integrated databases.
esm
davila7
Comprehensive toolkit for protein language models (ESM3, ESM C) enabling sequence, structure, and function prediction, design, and engineering tasks.
hqq-quantization
davila7
Enables 4/3/2-bit quantization of LLMs without calibration data, accelerating deployment via vLLM and HuggingFace Transformers.
simpo-training
davila7
Provides a reference-free, efficient alternative to DPO for LLM preference alignment, achieving better performance with simpler training.
torchdrug
davila7
PyTorch-based toolkit for biomedical graph machine learning, featuring GNNs for molecular property prediction, protein modeling, and drug discovery tasks.
kegg-database
davila7
Direct REST API access to KEGG database for academic research, enabling pathway analysis, gene mapping, and metabolic pathway exploration.
scikit-learn
davila7
A Python library for machine learning tasks including classification, regression, clustering, and model evaluation.
hmdb-database
davila7
Accesses Human Metabolome Database for searching metabolites, retrieving chemical properties, spectra, and pathways to support metabolomics research.
evaluating-code-models
davila7
Evaluates code generation models across HumanEval, MBPP, and MultiPL-E benchmarks using pass@k metrics for quality assessment.
pinecone
davila7
Provides a fully managed vector database for production AI applications, supporting RAG, semantic search, and scalable recommendation systems with low latency.
sglang
davila7
Accelerates LLM inference with RadixAttention prefix caching for structured JSON/regex outputs, constrained decoding, and agentic workflows.
polars
davila7
High-performance data manipulation library using Apache Arrow for efficient filtering, grouping, and I/O operations in data analysis workflows.
openrlhf-training
davila7
High-performance RLHF framework for training large language models (7B-70B+) using Ray, vLLM, and ZeRO-3, supporting PPO, DPO, and distributed training.
awq-quantization
davila7
Provides activation-aware weight quantization for 4-bit LLM compression, enabling 3x speedup with minimal accuracy loss on limited GPU memory.
rwkv-architecture
davila7
Provides an RNN-Transformer hybrid architecture for efficient AI inference with linear time complexity, infinite context, and production-ready large language models.
networkx
davila7
Comprehensive Python toolkit for creating, analyzing, and visualizing complex networks and graphs with graph algorithms and community detection capabilities.
constitutional-ai
davila7
Provides constitutional AI training for aligning language models with human values using self-critique and AI feedback to reduce harmful outputs without human labels.