ThirdBrAIn.tech

Tag: galileo-ai

2 items with this tag.

  • Apr 04, 2025

    https://i.ytimg.com/vi/PFlnCEqctDo/hqdefault.jpg

    Evaluation Agents Exploring the Next Frontier of GenAI Evals

    • LLM
    • ai-agent
    • agentic-evaluations
    • galileo-ai
    • AI-development
    • AI-tools
    • autonomous-agents
    • AI-safety
    • Galileo
    • Critique-of-Value-(COV)
    • Critique-of-Explanation-(COE)
    • Binary-Preference-Signal-(BPS)
    • Self-Augmenting-Agents
    • Single-Token-Probability
    • LLM-as-Judge
    • Agent-Evaluation
    • Agentic-Systems
    • RAG-Evaluation
    • RAG
    • Custom-Metrics
    • AI-Developers
    • Hallucinations-(AI)
    • GenAI-Evals
    • Model-Evaluation
    • Chain-of-Thought
    • Cost-Limit
    • Guardrails-(AI)
    • Luna
    • ChainPoll
    • observability
    • YT/2025/M04
    • YT/2025/W14
  • Jan 22, 2025

    https://i.ytimg.com/vi/QvStk5G8BZw/hqdefault.jpg

    How to Evaluate Agents Galileo’s Agentic Evaluations in Action

    • LLM
    • ai-agent
    • agentic-evaluations
    • ai-evaluation
    • galileo-ai
    • ai-agent-evaluation
    • LLM-evaluation
    • metrics
    • tool-errors
    • gen-ai-evaluations
    • Luna-evaluation-suite
    • failure-points
    • workflows
    • LLM-workflows
    • AI-development
    • AI-tools
    • agent-frameworks
    • agent-architectures
    • autonomous-agents
    • RAG-systems
    • Galileo-platform
    • AI-metrics
    • model-evaluation
    • AI-performance
    • AI-safety
    • cost-optimization
    • Galileo
    • nondeterministic
    • Galileo-Luna
    • latency-reduction
    • responsible-AI
    • YT/2025/M01
    • YT/2025/W04

Created with Quartz v4.5.0 © 2025 for

  • GitHub
  • Discord Community
  • Obsidian