3. Data, Science & AI
Found 9989 skills
pymc-bayesian-modeling
davila7
Enables Bayesian inference with PyMC, including hierarchical models, MCMC (NUTS), and model comparison techniques for probabilistic programming.
citation-management
davila7
Comprehensive citation management tool for academic research, enabling search, metadata extraction, citation validation, and BibTeX generation from sources like Google Scholar and PubMed.
tensorboard
davila7
Visualizes ML training metrics, model graphs, and performance for debugging and experiment comparison using Google's TensorBoard.
huggingface-accelerate
davila7
Unifies distributed training frameworks for PyTorch with minimal code changes, automatic device placement, and mixed precision support.
datamol
davila7
Simplifies RDKit for drug discovery tasks including SMILES parsing, molecular descriptors, and 3D conformer generation.
knowledge-distillation
davila7
Compresses large language models via knowledge distillation, retaining performance while reducing inference costs. Supports soft targets and logit distillation techniques.
neuropixels-analysis
davila7
Analyzes Neuropixels neural recordings including preprocessing, spike sorting, and AI-assisted quality assessment for extracellular electrophysiology data.
protocolsio-integration
davila7
Integrates with protocols.io API to manage scientific protocols, including creation, updates, collaboration, and documentation for lab and research workflows.
histolab
davila7
Toolkit for processing whole slide pathology images, including tile extraction, tissue segmentation, and dataset preparation for deep learning in computational pathology.
lamindb
davila7
Manages biological datasets with FAIR principles, ensuring traceability, reproducibility, and integration with scientific workflows and MLOps platforms.
scvi-tools
davila7
Enables probabilistic modeling and analysis of single-cell omics data, including scRNA-seq, scATAC-seq, and spatial transcriptomics, for tasks like batch correction and cell type annotation.
pyopenms
davila7
Python interface for mass spectrometry data analysis, enabling proteomics and metabolomics workflows with file handling and quantitative processing.
pytorch-lightning
davila7
High-level PyTorch framework simplifying training loops with distributed computing, callbacks, and minimal boilerplate for scalable machine learning development.
neurokit2
davila7
Comprehensive toolkit for processing and analyzing physiological signals including EEG, ECG, and EDA for scientific research and clinical applications.
llama-cpp
davila7
Runs LLM inference on CPU, Apple Silicon, and consumer GPUs without NVIDIA hardware using GGUF quantization for efficiency.
gget
davila7
Provides rapid bioinformatics queries via CLI and Python, accessing multiple biological databases for sequence analysis and research.
pytorch-fsdp
davila7
Expert guidance for PyTorch FSDP training with parameter sharding, mixed precision, and CPU offloading in distributed deep learning.
senior-data-engineer
davila7
Builds scalable data pipelines, ETL/ELT systems, and data infrastructure using Spark, Airflow, and dbt for data modeling and orchestration.
long-context
davila7
Extends transformer model context windows using RoPE, YaRN, ALiBi, and position interpolation for processing long documents (32k-128k+ tokens).
stable-baselines3
davila7
Provides a library for training and experimenting with reinforcement learning agents using algorithms like PPO and SAC in Gym environments.
cirq
davila7
Framework for building, simulating, and executing quantum circuits, supporting quantum algorithms and hardware integration.
pdf-processing-pro
davila7
Provides production-grade PDF processing for forms, tables, OCR, and batch operations with robust validation and error handling.
cocoindex
davila7
Comprehensive toolkit for building AI data pipelines including ETL workflows, vector embeddings, and knowledge graphs via CocoIndex library.
deeptools
davila7
Provides genomic data analysis and visualization for NGS workflows, including BAM to bigWig conversion, QC metrics, and heatmaps for ChIP-seq, RNA-seq, and ATAC-seq.