ThirdBrAIn.tech
Search
Search
Dark mode
Light mode
Explorer
Tag: agentic-evaluations
4 items with this tag.
Apr 25, 2025
The Future of AI Agents How Standards and Evaluation Drive Innovation
LLM
agentic-evaluations
AI-development
Agent-Evaluation
RAG
AGNTCY
Internet-of-Agents
AI-Agent
Agent-Protocol
Galileo-AI
AI-Observability
Artificial-Intelligence
How-To-Build-Agent
Software-Development
Agent-Interoperability
AI-Standards
Agent-Standards
Open-Source-AI
AI-Evaluation
Agent-to-Agent-Communication
Multi-Agent-Systems
A2A
MCP
ACP
Python
Cisco
LangChain
AI-Security
Agentic-AI
AI-Development
API-Integration
webinar
live-demo
Tool-Calling
YT/2025/M04
YT/2025/W17
Apr 11, 2025
AGNTCY & Galileo Building and Monitoring a Weather Agent
LLM
agentic-evaluations
AI-development
AI-tools
autonomous-agents
AI-safety
Galileo
Agent-Evaluation
Agentic-Systems
RAG
Custom-Metrics
AI-Developers
Hallucinations-(AI)
GenAI-Evals
Guardrails-(AI)
observability
AGNTCY
Internet-of-Agents
AI-Agent
Agent-Protocol
Agent-Framework
Galileo-AI
AI-Observability
AI-Monitoring
Artificial-Intelligence
Python
API-Integration
Agent-Tutorial
How-To-Build-Agent
AI-Demo
Developer-Tutorial
Software-Development
YT/2025/M04
YT/2025/W15
Apr 04, 2025
Evaluation Agents Exploring the Next Frontier of GenAI Evals
LLM
ai-agent
agentic-evaluations
galileo-ai
AI-development
AI-tools
autonomous-agents
AI-safety
Galileo
Critique-of-Value-(COV)
Critique-of-Explanation-(COE)
Binary-Preference-Signal-(BPS)
Self-Augmenting-Agents
Single-Token-Probability
LLM-as-Judge
Agent-Evaluation
Agentic-Systems
RAG-Evaluation
RAG
Custom-Metrics
AI-Developers
Hallucinations-(AI)
GenAI-Evals
Model-Evaluation
Chain-of-Thought
Cost-Limit
Guardrails-(AI)
Luna
ChainPoll
observability
YT/2025/M04
YT/2025/W14
Jan 22, 2025
How to Evaluate Agents Galileo’s Agentic Evaluations in Action
LLM
ai-agent
agentic-evaluations
ai-evaluation
galileo-ai
ai-agent-evaluation
LLM-evaluation
metrics
tool-errors
gen-ai-evaluations
Luna-evaluation-suite
failure-points
workflows
LLM-workflows
AI-development
AI-tools
agent-frameworks
agent-architectures
autonomous-agents
RAG-systems
Galileo-platform
AI-metrics
model-evaluation
AI-performance
AI-safety
cost-optimization
Galileo
nondeterministic
Galileo-Luna
latency-reduction
responsible-AI
YT/2025/M01
YT/2025/W04