ThirdBrAIn.tech
Search
Search
Dark mode
Light mode
Explorer
Tag: Custom-Metrics
7 items with this tag.
May 06, 2025
AGNTCY & Galileo Building and Monitoring a Weather Agent
LLM
agentic-evaluations
AI-development
AI-tools
autonomous-agents
AI-safety
Galileo
Agent-Evaluation
Agentic-Systems
RAG
Custom-Metrics
AI-Developers
Hallucinations-(AI)
GenAI-Evals
Guardrails-(AI)
observability
AGNTCY
Internet-of-Agents
AI-Agent
Agent-Protocol
Agent-Framework
Galileo-AI
AI-Observability
AI-Monitoring
Artificial-Intelligence
Python
API-Integration
Agent-Tutorial
How-To-Build-Agent
AI-Demo
Developer-Tutorial
Software-Development
May 06, 2025
AI Agent Evaluation Boosting Safety
galileo
agentic-AI
agents
agent-deployment
LLM
developers
evaluation
AI
artificial-intelligence
AI-agent
evaluating-AI-agents
AI-security
Galileo-AI
machine-learning
AI-agents
AI-Models
AI-Performance
Agent-Tooling
AI-Evaluation
AI-Monitoring
Custom-Metrics
Agent-Performance
Task-Completion
Agent-Advancement
Galileo-custom-metrics-for-AI-agents
Evaluating-complex-AI-agents
AI-safety
AI-Alignment
Agentic-Systems
LLM-Evaluation
Goal-Alignment
May 06, 2025
Build It, Evolve It AI Metrics That Adapt to Your Code
galileo
agentic-AI
agent-deployment
LLM
developers
evaluation
AI
artificial-intelligence
Galileo-AI
machine-learning
AI-safety
Software-Development
Artificial-Intelligence
Software-Engineering
MongoDB
AI-Engineers
Code-Quality
Automated-Testing
Evaluation-Metrics
AI-Development
Custom-Metrics
AI-evaluation
AI-metrics
AI-Code-Evaluation
human-in-the-loop
Metric-Customization
Code-Metrics
AI-Automation
AI-Evaluation-Tools
Generative-AI-Evaluation
ai
genai
podcast
May 06, 2025
Evaluation Agents Exploring the Next Frontier of GenAI Evals
LLM
ai-agent
agentic-evaluations
galileo-ai
AI-development
AI-tools
autonomous-agents
AI-safety
Galileo
Critique-of-Value-(COV)
Critique-of-Explanation-(COE)
Binary-Preference-Signal-(BPS)
Self-Augmenting-Agents
Single-Token-Probability
LLM-as-Judge
Agent-Evaluation
Agentic-Systems
RAG-Evaluation
RAG
Custom-Metrics
AI-Developers
Hallucinations-(AI)
GenAI-Evals
Model-Evaluation
Chain-of-Thought
Cost-Limit
Guardrails-(AI)
Luna
ChainPoll
observability
May 06, 2025
How Will AI Agent Evaluation Evolve?
galileo
agentic-AI
agents
agent-deployment
LLM
developers
evaluation
AI
artificial-intelligence
AI-agent
evaluating-AI-agents
AI-security
Galileo-AI
machine-learning
AI-agents
AI-Models
AI-Performance
Agent-Tooling
AI-Evaluation
AI-Monitoring
Custom-Metrics
Agent-Performance
Agent-Advancement
Galileo-custom-metrics-for-AI-agents
Evaluating-complex-AI-agents
AI-safety
AI-Alignment
Agentic-Systems
LLM-Evaluation
Human-Feedback
Human-in-the-loop
May 06, 2025
Legacy Systems? AI's Impact on Modernization (Months, Not Years)
galileo
agentic-AI
agents
agent-deployment
LLM
developers
evaluation
AI
artificial-intelligence
AI-agent
Galileo-AI
machine-learning
Custom-Metrics
AI-safety
AI-Modernization
Legacy-Systems
Software-Development
Artificial-Intelligence
Benchmarking
Software-Engineering
Automation
AI-vs-Human
MongoDB
Legacy-Code-Modernization
Application-Modernization
System-Modernization
Technical-Debt
AI-Code-Generation
Enterprise-Modernization
Database-Modernization
May 06, 2025
MongoDB's AI Code Quality Fix The Chain of Repair
galileo
agentic-AI
agents
agent-deployment
LLM
developers
evaluation
AI
artificial-intelligence
Galileo-AI
machine-learning
AI-safety
Software-Development
Artificial-Intelligence
Software-Engineering
Automation
MongoDB
AI-Engineers
AI-Code-Generation
Code-Quality
Software-Testing
Automated-Testing
Evaluation-Metrics
Code-Repair
AI-Development
Custom-Metrics
AI-evaluation
AI-metrics
Code-Optimization
AI-Automation
AI-code-quality
Developer-Tools