3. Data, Science & AI

24 skills

Found 9989 skills

Total Stars:6.7M
Avg Stars:667

astropy

davila7

18.0K

Comprehensive Python library for astronomical data analysis, handling coordinate transformations, FITS files, and cosmological calculations.

FITS
WCS
Cosmology
3. Data, Science & AI

faiss

davila7

18.0K

Provides efficient similarity search and clustering for dense vectors, supporting billions of vectors with GPU acceleration and multiple index types for high-performance AI applications.

Vector Search
GPU Acceleration
k-NN
3. Data, Science & AI

matchms

davila7

18.0K

Processes mass spectrometry data (mzML/MGF/MSP) with spectral similarity calculations and metadata harmonization for metabolomics research.

mzML
spectral similarity
metabolomics
3. Data, Science & AI

scikit-survival

davila7

18.0K

Comprehensive Python toolkit for survival analysis, including Cox models, Random Survival Forests, and evaluation metrics for time-to-event data.

Survival Analysis
Cox Models
Random Survival Forests
3. Data, Science & AI

senior-computer-vision

davila7

18.0K

Provides advanced computer vision capabilities including object detection, segmentation, and model deployment using PyTorch, OpenCV, and vision transformers for production AI systems.

PyTorch
YOLO
Vision Transformers
3. Data, Science & AI

pytdc

davila7

18.0K

Offers AI-ready drug discovery datasets including ADME, toxicity, and DTI with benchmarks and scaffold splits for therapeutic machine learning.

ADME
DTI
Scaffold Splits
3. Data, Science & AI

reactome-database

davila7

18.0K

Queries Reactome REST API for pathway analysis, gene-pathway mapping, and molecular interaction studies in systems biology research.

Reactome
REST API
Pathway Analysis
3. Data, Science & AI

unsloth

davila7

18.0K

Expert guidance for fast fine-tuning of AI models using Unsloth, achieving 2-5x speedup and 50-80% memory reduction via LoRA/QLoRA optimization.

Unsloth
LoRA
QLoRA
3. Data, Science & AI

ray-data

davila7

18.0K

Scalable data processing for ML workloads, supporting Parquet, CSV, JSON, and images. Integrates with Ray Train, PyTorch, TensorFlow for distributed ETL and preprocessing.

Ray
Distributed
Multi-modal
3. Data, Science & AI

rdkit

davila7

18.0K

Provides molecular data processing including SMILES parsing, descriptor calculation, and substructure search for cheminformatics applications.

SMILES
Substructure Search
Molecular Fingerprints
3. Data, Science & AI

zarr-python

davila7

18.0K

A Python library for efficient storage and processing of large scientific datasets using chunked arrays, with cloud storage integration and compatibility with NumPy, Dask, and Xarray.

Zarr
NumPy
Dask
3. Data, Science & AI

mamba-architecture

davila7

18.0K

Provides a state-space model architecture (Mamba) for efficient sequence processing with O(n) complexity, 5ร— faster inference, and million-token sequence support without KV cache.

Selective SSM
Mamba
O(n) complexity
3. Data, Science & AI

dnanexus-integration

davila7

18.0K

Enables genomics pipeline development and execution on DNAnexus cloud platform via dxpy SDK for data management and analysis of FASTQ/BAM/VCF formats.

DNAnexus
dxpy
Genomics
3. Data, Science & AI

plotly

davila7

18.0K

Interactive Python library for creating scientific, statistical, and financial visualizations including charts, plots, and dashboards with customizable options.

Plotly
Data Visualization
Interactive Charts
3. Data, Science & AI

get-available-resources

davila7

18.0K

Detects and reports system resources (CPU, GPU, memory) to guide computational strategy for scientific tasks, including recommendations for parallel processing and GPU acceleration.

Resource Profiling
Scientific Computing
GPU Acceleration
3. Data, Science & AI

transformer-lens-interpretability

davila7

18.0K

Guides mechanistic interpretability research using TransformerLens to inspect transformer internals, attention patterns, and activation patching experiments.

TransformerLens
Mechanistic interpretability
Activation patching
3. Data, Science & AI

qutip

davila7

18.0K

Provides quantum simulation and analysis for quantum systems, including states, operators, and dynamics, using QuTiP.

QuTiP
Quantum dynamics
Open quantum systems
3. Data, Science & AI

chembl-database

davila7

18.0K

Query ChEMBL database for bioactive molecules, retrieve bioactivity data (IC50, Ki), and perform structure-activity relationship studies in medicinal chemistry.

ChEMBL
Bioactivity
SAR
3. Data, Science & AI

outlines

davila7

18.0K

Provides type-safe structured output generation for AI models using Pydantic, ensuring valid JSON/XML/code with local model support and optimized speed.

Pydantic
Structured Generation
vLLM
3. Data, Science & AI

scanpy

davila7

18.0K

Comprehensive single-cell RNA-seq analysis pipeline including data loading, QC, dimensionality reduction, clustering, and cell type annotation using Scanpy.

Scanpy
scRNA-seq
AnnData
3. Data, Science & AI

statsmodels

davila7

18.0K

Statistical modeling toolkit providing OLS, GLM, logistic regression, ARIMA, and hypothesis tests for data analysis and econometric research.

OLS
ARIMA
Time Series
3. Data, Science & AI

deepchem

davila7

18.0K

DeepChem enables molecular property prediction, GNN-based drug discovery modeling, and benchmarking via MoleculeNet for pharmaceutical applications.

GNN
MoleculeNet
ADMET
3. Data, Science & AI

pydeseq2

davila7

18.0K

Conducts differential gene expression analysis on RNA-seq data with DESeq2, including statistical testing and visualization.

DESeq2
RNA-seq
Differential Expression
3. Data, Science & AI

xlsx

davila7

18.0K

Provides tools for creating, editing, and analyzing spreadsheet data with formulas, formatting, and visualization for .xlsx and .csv files.

XLSX
CSV
Data Analysis
3. Data, Science & AI
PreviousPage 6 of 417 PageNext