3. Data, Science & AI
Found 9989 skills
postgres-schema-design
davila7
Comprehensive reference for designing PostgreSQL database schemas, including data types, indexing, constraints, and performance patterns.
clinical-decision-support
davila7
Generates evidence-based clinical decision support documents with statistical analysis, biomarker stratification, and regulatory compliance for pharmaceutical research.
cosmic-database
davila7
Provides programmatic access to COSMIC database for querying cancer mutations, gene fusions, and mutational signatures in precision oncology research.
sympy
davila7
Enables exact symbolic computation in Python for algebra, calculus, equations, and mathematical expressions without numerical approximation.
llamaindex
davila7
Data framework for building LLM applications with RAG, featuring document ingestion, indexing, and querying for knowledge retrieval systems.
whisper
davila7
AI-powered speech-to-text model for multilingual transcription, translation, and language identification.
matplotlib
davila7
Creates scientific visualizations including line plots, bar charts, heatmaps, and 3D graphs for publication-ready figures in multiple formats.
vaex
davila7
Enables efficient out-of-core processing and analysis of massive tabular datasets (billions of rows) with lazy evaluation and visualization.
davila7
Programmatic PDF toolkit for extracting text and tables, creating, merging, splitting, and filling forms to enable document data analysis.
geopandas
davila7
Python library for geospatial vector data analysis, enabling spatial operations, coordinate transformations, and map visualization with common formats.
audiocraft-audio-generation
davila7
Generates music and sound effects from text descriptions using AI models (MusicGen and AudioGen) built on PyTorch.
molfeat
davila7
Provides molecular featurization tools for machine learning, including ECFP, MACCS, and ChemBERTa, to convert SMILES into features for QSAR and molecular ML.
flowio
davila7
Processes FCS flow cytometry files, extracting events and metadata into NumPy arrays and DataFrames for scientific data preprocessing.
senior-ml-engineer
davila7
Enables production deployment, monitoring, and scaling of ML models with MLOps, RAG systems, and LLM integration for enterprise AI solutions.
gguf-quantization
davila7
Optimizes AI model inference on CPU/GPU by quantizing models to 2-8 bits using GGUF format and llama.cpp, enabling deployment on consumer hardware.
perplexity-search
davila7
Enables real-time AI-powered web searches using Perplexity models for current information, scientific literature, and source-cited answers via OpenRouter API.
fda-database
davila7
Provides API access to openFDA data for drugs, devices, adverse events, recalls, and regulatory submissions to support safety research and analysis.
gene-database
davila7
Queries NCBI Gene database via E-utilities/Datasets API to retrieve gene info (RefSeqs, GO, locations, phenotypes) for annotation and functional analysis.
pymoo
davila7
Provides a multi-objective optimization framework with algorithms like NSGA-II and MOEA/D for engineering design and optimization problems.
nemo-guardrails
davila7
Runtime safety framework for LLM applications with jailbreak detection, hallucination prevention, and PII filtering using Colang 2.0 DSL.
modal
davila7
Deploys and scales machine learning models and compute-intensive Python workloads in the cloud with GPU acceleration and auto-scaling.
umap-learn
davila7
Provides fast nonlinear dimensionality reduction using UMAP for 2D/3D visualization and clustering preprocessing of high-dimensional data.
implementing-llms-litgpt
davila7
Enables clean, single-file LLM training and fine-tuning with LitGPT, supporting LoRA/QLoRA for educational and production use.
model-pruning
davila7
Reduces LLM size and accelerates inference using pruning techniques like Wanda and SparseGPT, achieving 50% sparsity with minimal accuracy loss.