3. Data, Science & AI

24 skills

Found 9989 skills

Total Stars:6.7M
Avg Stars:667

postgres-schema-design

davila7

18.0K

Comprehensive reference for designing PostgreSQL database schemas, including data types, indexing, constraints, and performance patterns.

PostgreSQL
Schema Design
Indexing
3. Data, Science & AI

clinical-decision-support

davila7

18.0K

Generates evidence-based clinical decision support documents with statistical analysis, biomarker stratification, and regulatory compliance for pharmaceutical research.

GRADE
Biomarker Stratification
Survival Curves
3. Data, Science & AI

cosmic-database

davila7

18.0K

Provides programmatic access to COSMIC database for querying cancer mutations, gene fusions, and mutational signatures in precision oncology research.

COSMIC
Cancer Genomics
Somatic Mutations
3. Data, Science & AI

sympy

davila7

18.0K

Enables exact symbolic computation in Python for algebra, calculus, equations, and mathematical expressions without numerical approximation.

SymPy
Symbolic Computation
Algebraic Manipulation
3. Data, Science & AI

llamaindex

davila7

18.0K

Data framework for building LLM applications with RAG, featuring document ingestion, indexing, and querying for knowledge retrieval systems.

RAG
Vector Indices
LLM
3. Data, Science & AI

whisper

davila7

18.0K

AI-powered speech-to-text model for multilingual transcription, translation, and language identification.

Whisper
Speech Recognition
Multilingual
3. Data, Science & AI

matplotlib

davila7

18.0K

Creates scientific visualizations including line plots, bar charts, heatmaps, and 3D graphs for publication-ready figures in multiple formats.

Matplotlib
Scientific Visualization
Publication Figures
3. Data, Science & AI

vaex

davila7

18.0K

Enables efficient out-of-core processing and analysis of massive tabular datasets (billions of rows) with lazy evaluation and visualization.

Vaex
Out-of-core
Big data
3. Data, Science & AI

pdf

davila7

18.0K

Programmatic PDF toolkit for extracting text and tables, creating, merging, splitting, and filling forms to enable document data analysis.

PDF
Text Extraction
Document Processing
3. Data, Science & AI

geopandas

davila7

18.0K

Python library for geospatial vector data analysis, enabling spatial operations, coordinate transformations, and map visualization with common formats.

geopandas
Spatial Analysis
Vector Data
3. Data, Science & AI

audiocraft-audio-generation

davila7

18.0K

Generates music and sound effects from text descriptions using AI models (MusicGen and AudioGen) built on PyTorch.

PyTorch
MusicGen
AudioGen
3. Data, Science & AI

molfeat

davila7

18.0K

Provides molecular featurization tools for machine learning, including ECFP, MACCS, and ChemBERTa, to convert SMILES into features for QSAR and molecular ML.

ECFP
SMILES
ChemBERTa
3. Data, Science & AI

flowio

davila7

18.0K

Processes FCS flow cytometry files, extracting events and metadata into NumPy arrays and DataFrames for scientific data preprocessing.

FCS
NumPy
DataFrame
3. Data, Science & AI

senior-ml-engineer

davila7

18.0K

Enables production deployment, monitoring, and scaling of ML models with MLOps, RAG systems, and LLM integration for enterprise AI solutions.

MLOps
RAG
LLM
3. Data, Science & AI

gguf-quantization

davila7

18.0K

Optimizes AI model inference on CPU/GPU by quantizing models to 2-8 bits using GGUF format and llama.cpp, enabling deployment on consumer hardware.

GGUF
llama.cpp
Quantization
3. Data, Science & AI

perplexity-search

davila7

18.0K

Enables real-time AI-powered web searches using Perplexity models for current information, scientific literature, and source-cited answers via OpenRouter API.

Perplexity
OpenRouter
RAG
3. Data, Science & AI

fda-database

davila7

18.0K

Provides API access to openFDA data for drugs, devices, adverse events, recalls, and regulatory submissions to support safety research and analysis.

openFDA
UNII
510k
3. Data, Science & AI

gene-database

davila7

18.0K

Queries NCBI Gene database via E-utilities/Datasets API to retrieve gene info (RefSeqs, GO, locations, phenotypes) for annotation and functional analysis.

NCBI Gene
E-utilities
Gene Ontology
3. Data, Science & AI

pymoo

davila7

18.0K

Provides a multi-objective optimization framework with algorithms like NSGA-II and MOEA/D for engineering design and optimization problems.

NSGA-II
MOEA/D
Pareto fronts
3. Data, Science & AI

nemo-guardrails

davila7

18.0K

Runtime safety framework for LLM applications with jailbreak detection, hallucination prevention, and PII filtering using Colang 2.0 DSL.

LLM Safety
Colang 2.0
Hallucination Detection
3. Data, Science & AI

modal

davila7

18.0K

Deploys and scales machine learning models and compute-intensive Python workloads in the cloud with GPU acceleration and auto-scaling.

GPU
Auto-scaling
ML
3. Data, Science & AI

umap-learn

davila7

18.0K

Provides fast nonlinear dimensionality reduction using UMAP for 2D/3D visualization and clustering preprocessing of high-dimensional data.

UMAP
Dimensionality Reduction
HDBSCAN
3. Data, Science & AI

implementing-llms-litgpt

davila7

18.0K

Enables clean, single-file LLM training and fine-tuning with LitGPT, supporting LoRA/QLoRA for educational and production use.

LitGPT
LLMs
LoRA
3. Data, Science & AI

model-pruning

davila7

18.0K

Reduces LLM size and accelerates inference using pruning techniques like Wanda and SparseGPT, achieving 50% sparsity with minimal accuracy loss.

Pruning
Sparsity
LLM
3. Data, Science & AI
PreviousPage 5 of 417 PageNext