Loading...
Loading...
Found 15241 skills
mberto10
Manages Langfuse prompts with version control, deployment labels, and comparison for LLM application development.
mberto10
Builds foundational evaluation infrastructure for AI/ML models, including datasets, graders, harness, and baselines required before optimization.
mberto10
Configures agent evaluation pipelines including flow discovery, quality dimensions, dataset creation, and judge prompts for AI agent testing.
mberto10
Diagnoses root causes in LLM application workflows by combining Langfuse trace data with codebase investigation to resolve failures and improve output quality.
mberto10
Runs Langfuse experiments to evaluate LLM performance, compare model and prompt variations, and analyze failures.
mberto10
Executes and analyzes LLM evaluation experiments using Langfuse, including prompt testing, dataset evaluation, and LLM-as-judge comparisons.
mberto10
Analyzes score trends, regressions, and quality metrics distributions over time for performance evaluation and improvement.
mberto10
Surgically retrieves Langfuse observability data for debugging and analyzing LLM application traces with multiple output modes.
mberto10
Defines goal, constraints, and adjustable parameters for optimization problems in AI and data science frameworks.
mberto10
Enables session-level analysis in Langfuse to inspect, debug multi-turn conversations, identify errors, and evaluate performance metrics.
mberto10
Generates ASCII-based data visualizations including charts, graphs, and progress bars for terminal display in Claude Code responses.
mberto10
Configures Langfuse tracing, observability, and scoring for Python-based LLM pipelines to monitor and debug AI model interactions.
guicheffer
Enables cross-platform event tracking for user interactions and feature usage with analytics providers like Firebase and Adjust.
mberto10
Manages Langfuse datasets for LLM experiment validation and regression testing, including dataset creation, trace curation, and test set building.
xiaxianlin
Generates audio files from text segments using MiniMax TTS API with built-in error handling and retry logic.
mberto10
Manages Langfuse datasets for LLM applications, enabling creation, curation of regression/golden sets, and dataset item inspection.
sskim91
Initializes Serena MCP for AI-enhanced semantic analysis of code, enabling deeper code structure and meaning interpretation during analysis.
mberto10
Provides strategic guidance for evaluating, improving AI agents, selecting metrics, building datasets, and setting up iteration loops with Langfuse.
sskim91
Optimizes database performance through advanced SQL query tuning, indexing strategies, and EXPLAIN plan analysis.
mberto10
Retrieves and analyzes Langfuse traces, runs, and metadata for debugging and optimizing LLM application workflows.
mberto10
Analyzes LLM model quality metrics including score trends, regressions, and distributions across releases and environments in Langfuse.
simplerick0
Specializes in relational database design, schema modeling, normalization, indexing, and migration planning for optimal data integrity and performance.
mberto10
Provides a hypothesis-driven methodology for systematically improving AI agent performance through iterative optimization cycles.
simplerick0
Configures and manages Cursor IDE agents for parallel AI-assisted coding, supporting background, cloud, and multi-agent development workflows.