3. Data, Science & AI
Found 9989 skills
astropy
davila7
Comprehensive Python library for astronomical data analysis, handling coordinate transformations, FITS files, and cosmological calculations.
faiss
davila7
Provides efficient similarity search and clustering for dense vectors, supporting billions of vectors with GPU acceleration and multiple index types for high-performance AI applications.
matchms
davila7
Processes mass spectrometry data (mzML/MGF/MSP) with spectral similarity calculations and metadata harmonization for metabolomics research.
scikit-survival
davila7
Comprehensive Python toolkit for survival analysis, including Cox models, Random Survival Forests, and evaluation metrics for time-to-event data.
senior-computer-vision
davila7
Provides advanced computer vision capabilities including object detection, segmentation, and model deployment using PyTorch, OpenCV, and vision transformers for production AI systems.
pytdc
davila7
Offers AI-ready drug discovery datasets including ADME, toxicity, and DTI with benchmarks and scaffold splits for therapeutic machine learning.
reactome-database
davila7
Queries Reactome REST API for pathway analysis, gene-pathway mapping, and molecular interaction studies in systems biology research.
unsloth
davila7
Expert guidance for fast fine-tuning of AI models using Unsloth, achieving 2-5x speedup and 50-80% memory reduction via LoRA/QLoRA optimization.
ray-data
davila7
Scalable data processing for ML workloads, supporting Parquet, CSV, JSON, and images. Integrates with Ray Train, PyTorch, TensorFlow for distributed ETL and preprocessing.
rdkit
davila7
Provides molecular data processing including SMILES parsing, descriptor calculation, and substructure search for cheminformatics applications.
zarr-python
davila7
A Python library for efficient storage and processing of large scientific datasets using chunked arrays, with cloud storage integration and compatibility with NumPy, Dask, and Xarray.
mamba-architecture
davila7
Provides a state-space model architecture (Mamba) for efficient sequence processing with O(n) complexity, 5ร faster inference, and million-token sequence support without KV cache.
dnanexus-integration
davila7
Enables genomics pipeline development and execution on DNAnexus cloud platform via dxpy SDK for data management and analysis of FASTQ/BAM/VCF formats.
plotly
davila7
Interactive Python library for creating scientific, statistical, and financial visualizations including charts, plots, and dashboards with customizable options.
get-available-resources
davila7
Detects and reports system resources (CPU, GPU, memory) to guide computational strategy for scientific tasks, including recommendations for parallel processing and GPU acceleration.
transformer-lens-interpretability
davila7
Guides mechanistic interpretability research using TransformerLens to inspect transformer internals, attention patterns, and activation patching experiments.
qutip
davila7
Provides quantum simulation and analysis for quantum systems, including states, operators, and dynamics, using QuTiP.
chembl-database
davila7
Query ChEMBL database for bioactive molecules, retrieve bioactivity data (IC50, Ki), and perform structure-activity relationship studies in medicinal chemistry.
outlines
davila7
Provides type-safe structured output generation for AI models using Pydantic, ensuring valid JSON/XML/code with local model support and optimized speed.
scanpy
davila7
Comprehensive single-cell RNA-seq analysis pipeline including data loading, QC, dimensionality reduction, clustering, and cell type annotation using Scanpy.
statsmodels
davila7
Statistical modeling toolkit providing OLS, GLM, logistic regression, ARIMA, and hypothesis tests for data analysis and econometric research.
deepchem
davila7
DeepChem enables molecular property prediction, GNN-based drug discovery modeling, and benchmarking via MoleculeNet for pharmaceutical applications.
pydeseq2
davila7
Conducts differential gene expression analysis on RNA-seq data with DESeq2, including statistical testing and visualization.
xlsx
davila7
Provides tools for creating, editing, and analyzing spreadsheet data with formulas, formatting, and visualization for .xlsx and .csv files.