29 items under this folder.

Qwen 3 Embeddings & Rerankers
Sam Witteveen

Qwen 3 Embeddings & Rerankers

Qwen-3-RerankerQwen-3-EmbeddingText-EmbeddingsRerankerQwen3AIArtificial-IntelligenceNLPNatural-Language-ProcessingMachine-LearningLarge-Language-ModelsLLMHugging-FaceSemantic-SearchInformation-RetrievalVector-SearchOpen-Source-AIAI-ModelsTransformer-ModelsDocument-EmbeddingsSentence-EmbeddingsSearch-RelevanceQwen-AIAI-TutorialNLP-TutorialText-RepresentationEmbedding-ModelsMistralOpenAIGeminiMTEBYT/2025/M06YT/2025/W23

Building with Chatterbox TTS, Voice Cloning & Watermarking
Sam Witteveen

Building with Chatterbox TTS, Voice Cloning & Watermarking

Chatterbox-TTSDia-TTSGemini-TTSvoice-cloningdeepfake-voicefake-voiceKokoro-TTSResemble-AItext-to-speechTTSsynthetic-voicevoice-generationopen-source-TTSspeech-synthesisnatural-language-processingmachine-learningartificial-intelligencevoice-technologyaudio-deepfakevoice-replicationTTS-modelexpressive-speechreal-time-TTSvoice-conversionzero-shot-voice-cloningaudio-generationTTS-softwarevoice-assistant-technologyYT/2025/M06YT/2025/W23

MedGemma - An Open Doctor Model?
Sam Witteveen

MedGemma - An Open Doctor Model?

MedGemmaGemmaGoogle-GemmaLarge-Language-ModelMedical-AIAI-in-MedicineHealthcare-AIAI-in-HealthcareGenerative-AIOpen-ModelGoogle-AIDeepMindHealth-TechClinical-AIMedical-ResearchNLPNatural-Language-ProcessingFoundation-ModelAI-for-DoctorsAI-for-Healthcare-ProfessionalsFuture-of-MedicineMedical-LLMMed-PaLMMed-PaLM-2Biomedical-AIHealth-AIMachine-Learning-in-MedicineGoogle-I/O-2025Gemma-3YT/2025/M06YT/2025/W23MLMAIHealthcareAIMedicalAI

Mistral Agents API - The NEW Agent System
Sam Witteveen

Mistral Agents API - The NEW Agent System

Mistral-AIMistral-AgentsMistral-Agents-APIMistral-LLMMixtralLe-ChatAI-AgentsAutonomous-AgentsIntelligent-AgentsAgent-FrameworkLLM-AgentsGenerative-AIArtificial-IntelligenceMachine-LearningLarge-Language-ModelsAI-APIAgent-SDKMulti-Agent-SystemsAI-AutomationTask-AutomationOpenAIOpenAI-AgentsGoogle-ADKVertex-AIMicrosoft-AISemantic-KernelSoftware-DevelopmentNatural-Language-ProcessingAI-ToolsWorkflow-AutomationYT/2025/M05YT/2025/W22

Gemini TTS - Native Audio Out
Sam Witteveen

Gemini TTS - Native Audio Out

Google-Gemini-2.5Gemini-2.5Gemini-AIText-to-SpeechTTSNative-AudioGemini-2.5-TTSGemini-2.5-Native-AudioSpeech-SynthesisAI-VoiceRealistic-AI-VoiceNatural-Sounding-TTSHigh-Quality-TTSVoice-GenerationAI-Voice-GeneratorGoogle-Gemini-FeaturesGemini-2.5-DemoGemini-2.5-ReviewAI-ToolsText-to-VoiceGoogle-Cloud-AIArtificial-IntelligenceMachine-LearningNext-Gen-TTSGemini-TTS-UpdateGoogle-Gemini-VoiceDia-TTSKokoro-TTSNotebookLMYT/2025/M05YT/2025/W22

Google I/O 25 - Models vs Products
Sam Witteveen

Google I/O 25 - Models vs Products

Google-I/O-2025GoogleIO2025Google-KeynoteAndroid-16Pixel-9aPixel-Fold-2Pixel-Watch-3Pixel-Buds-Pro-3Tensor-G5Google-AIGeminiGemini-APIGenerative-AIGoogle-AssistantWear-OSGoogle-TVChromeOSDeveloper-ToolsGoogle-CloudFirebaseFlutterAI-in-SearchGoogle-Workspace-AINew-Google-ProductsGoogle-AnnouncementsTech-NewsFuture-of-TechAndroid-16-FeaturesNew-Nest-HubGoogle-ARProject-Astra-UpdateYT/2025/M05YT/2025/W21

NVIDIA beats Whisper with Parakeetv2
Sam Witteveen

NVIDIA beats Whisper with Parakeetv2

NVIDIA-Parakeet-V2Parakeet-ASRASRLLMSpeech-to-TextNVIDIA-AINVIDIA-NeMoMachine-LearningFastConformerTDT-DecoderSpeech-RecognitionHigh-Accuracy-ASRTimestamp-FormattingPunctuation-RestorationSong-to-Lyrics-TranscriptionLong-Form-Audio-TranscriptionBackground-Noise-RobustnessOpen-Source-ASRHugging-FaceVoice-ApplicationsTranscription-ServicesSubtitle-GenerationNemo-Toolkithow-to-use-NVIDIA-Parakeet-for-transcriptionYT/2025/M05YT/2025/W20

Slash Your Gemini Bill Up To 75 %
Sam Witteveen

Slash Your Gemini Bill Up To 75 %

Cachingprompt-cachingdiscountgemini-2.5-proGeminiGooglecontext-cachinggemini-context-cachingcachereduce-costsperformance-boostImplicit-CachingGoogle-AILarge-Language-ModelLLMArtificial-IntelligenceAIToken-SavingAPI-EfficiencyAI-Cost-ReductionGemini-2.5-ProGemini-2.5-FlashGPT-4oChatGPTClaude-3Claude-3.5-SonnetLLM-ComparisonGoogle-Cloud-AILLM-CachingYT/2025/M05YT/2025/W20

The Improved Gemini 2.5 Pro - A Coding Powerhouse
Sam Witteveen

The Improved Gemini 2.5 Pro - A Coding Powerhouse

gemini-2.5-progemini-2.5transcriptiongemini2.5gemini-2.5-updateAI-Studiogemini-2.5-pro-audiogemini-2.5-pro-codingartificial-intelligencelarge-language-modeldeep-learningGPT-4chatgptclaudegemini-coderwindsurfcursorMCPA2AADKAgent-Development-KitYT/2025/M05YT/2025/W19

Microsoft Joins the Reasoning Race!!
Sam Witteveen

Microsoft Joins the Reasoning Race!!

Phi-4Phi-4-MiniPhi-4-ReasoningPhi-4-Mini-ReasoningMicrosoft-Phi-4AI-ReasoningMathematical-Reasoning-AILogical-Reasoning-AISmall-Language-ModelSLM-ReasoningTransformer-ModelAI-Problem-SolvingPhi-4-Mini-InstructPhi-4-MultimodalMicrosoft-AI-ModelsDeepSeekDeepSeek-R1-Distill-Llama-8BLlamaQwenMistralGemmaLarge-Language-ModelsNatural-Language-ProcessingNLPAI-BenchmarksGSM8kMATH-BenchmarkAI-ResearchOllamaOpen-Source-AIYT/2025/M05YT/2025/W18

Introducing the Qwen 3 Family
Sam Witteveen

Introducing the Qwen 3 Family

Qwen3Qwen-3Qwen3-LLMQwen-3-ModelQwen3-AIQwen3-ReleaseQwen-3rd-GenQwen3-0.6BQwen3-4BQwen3-8BQwen3-14BQwen3-32BQwen3-30B-A3BQwen3-235B-A22BQwen3-MoEQwen3-DenseAlibaba-CloudAlibaba-AIQwen-TeamAlibaba-GroupHybrid-ReasoningThinking-Mode-AIAI-Reasoning-Modes119-Languages36T-TokensOpen-Source-AIApache-2.0-LicenseLarge-Language-ModelLLMMixture-of-ExpertsDense-LLM128k-ContextHugging-FaceModelScopeo1Best-Open-Source-LLMYT/2025/M04YT/2025/W18

Dia 1.6B TTS for NotebookLM Podcasts
Sam Witteveen

Dia 1.6B TTS for NotebookLM Podcasts

text-to-speechTTSpodcastvoice-cloningmachine-learningopen-sourceAI-modelsaudio-synthesisneural-networksNotebookLMYT/2025/M04YT/2025/W17

GPT-4.1 - The Catchup Models
Sam Witteveen

GPT-4.1 - The Catchup Models

OpenAILLMAIartificial-intelligencemachine-learningGPTChatGPTGPT-4.1GPT-4GPT-4o4o-minio3o3-miniNanoMiniAnthropicClaudeSonnet-3.53.63.7GeminiFlashFlash-liteGemini-2.0-ProGemini-2.5-Protool-callsinstruction-followingLLM-reasoningCoTchain-of-thoughtfunction-callingSWE-BenchWindsurfADAlong-contextlatencyopenai-latest-modelopen-sourcegpt-4.1-minigpt-4.1-nanoYT/2025/M04YT/2025/W16

Google's NEW Agent2Agent Protocol
Sam Witteveen

Google's NEW Agent2Agent Protocol

GoogleAgent2Agent-Protocolartificial-intelligencedecentralized-communicationdeveloper-toolscloud-technologycollaborationAI-developmentGoogle-Cloud-Next-2025software-standardsYT/2025/M04YT/2025/W15

Google Launches an Agent SDK - Agent Development Kit
Sam Witteveen

Google Launches an Agent SDK - Agent Development Kit

google-cloud-next-2025agent-developer-kitADKagent2agent-protocolgeminiartificial-intelligenceironwoodTPULyriavertex-AIAgentspaceAI-Agentsopenaiagent-SDKclaudesonnetlangchainllamaindexMCPcrewaiagent-frameworkYT/2025/M04YT/2025/W15

Gemini 2.5 Pro for YouTube Analysis
Sam Witteveen

Gemini 2.5 Pro for YouTube Analysis

gemini-2.5videotranscriptiondiarizationgooglegemini2.5proexperimentalgemini-2.5-experimentalOCRAI-Studioopenaillama-4gemini-2.5-pro-videogemini-2.5-pro-codinggemini-2.5-pro-how-to-usetutorialgemini-2.5-pro-featurescode-transcriptionvideo-transcriptionprompt-with-a-video-and-textgemini-2.5-videoYT/2025/M04YT/2025/W15

Gemini 2.5 Pro for Audio Transcription
Sam Witteveen

Gemini 2.5 Pro for Audio Transcription

gemini-2.5Audiotranscriptiondiarizationgooglegemini2.5proexperimentalgemini-2.5-experimentalwhisperAI-Studiooutput-tokensopenaillama-4gemini-2.5-pro-audiogemini-2.5-pro-codinggemini-2.5-pro-how-to-usetutorialcolabgemini-2.5-pro-featuresartificial-intelligenceAIllmlarge-language-modeldeep-learningGPT-4chatgptclaudeYT/2025/M04YT/2025/W14

OpenAI Needs YOU!!
Sam Witteveen

OpenAI Needs YOU!!

openaigpt-2o3-minisalesforceCTRLqwenqwen2.5-omni4omulti-modalmultimodalsoraimage-generationomniopen-source1.5B400B70Bopen-weight-LLMLLMAIartificial-intelligenceaillmmistralchatgptai-newsclaudeanthropicapple-aiapple-intelligencellamameta-aigoogle-aitiktokmultimodal-llmYT/2025/M04YT/2025/W14

Creating Mind Maps with OpenAI's Image Generation
Sam Witteveen

Creating Mind Maps with OpenAI's Image Generation

mind-mapsimage-generationAI-instruction-followingOpenAIprompt-engineeringcreative-visualizationAI-toolsvisual-thinkingneural-networkscontent-creationYT/2025/M03YT/2025/W13

Qwen 2.5 Omni - Your NEW Open Omni Powerhouse
Sam Witteveen

Qwen 2.5 Omni - Your NEW Open Omni Powerhouse

qwenqwen2.52.5omnivlmsllmsreasoningmoshigemini-pro-1.5audiovideotextmultimodalopenaiapache-2.0open-sourceopen-weightsgooglenew-llmnew-Qwen-modelAIartificial-intelligencelarge-language-modelclaudeanthropicqwen-2.5-omniqwen-2.5onew-qwen-2.5-omni-modelspeech-to-speech-ai-modelYT/2025/M03YT/2025/W13

Gemini 2.5 - The Thinking Family of Models
Sam Witteveen

Gemini 2.5 - The Thinking Family of Models

LLMsgeminigoogle2.5proDeepSeekGemini-2.5-Progoogle-gemini-2.5-pronew-gemini-2.5-pronew-ai-gemini-2.5-pronew-google-gemini-2.5-pro-ai-modelnew-ai-coder-gemini-2.5-prodeepseek-v3.1deepseek-v3Gemini-2.5Google-AIGPT-4.5-vs-GeminiDeepSeek-R1best-AI-modelAI-chatbotGPT-5LLaMA-3AI-codingreasoning-AIo3-miniclaude-3.7-sonnetgrok-3-betagemini-2.0-progemini-aigoogle-aiYT/2025/M03YT/2025/W13

NVIDIA's New Reasoning Models
Sam Witteveen

NVIDIA's New Reasoning Models

nvidiagtc-2025nemotronllama-3.1llama-3.349B9BQwenDeepSeekR1V1agentic-aiAIartificial-intelligencelarge-language-modelllmnemotron-model-familiesjensen-huangroboticsaccelerated-computingmeta-AIllama-nemotronnvidia-agentsYT/2025/M03YT/2025/W12

SmolDocling - The SmolOCR Solution?
Sam Witteveen

SmolDocling - The SmolOCR Solution?

olmocrmistral-ocrgemini-ocropenai-ocrsmoldoclingvlmsmolagentssmol-vlmssmol-models1B256Mvlm-ocropen-sourcellmartificial-intelligencelarge-language-modelqwen-2-VLOCRoptical-character-recognitionpdf-ocrdocument-ocrIBMocr-imagesocr-pdfocr-docxdoclingdocling-ocrSmolVLMSmolOCRYT/2025/M03YT/2025/W12

How to Build an Agent with the OpenAI Agents SDK
Sam Witteveen

How to Build an Agent with the OpenAI Agents SDK

openaiaiopenai-agentai-agentopenai-customer-agentopen-aiopen-ai-agentchatgptchat-gpt-ai-agentsopenai-agentsIn-N-Out-chatbotfastfood-AI-Agentsllmartificial-intelligencelarge-language-modelclaudeanthropicapple-intelligencellamahow-to-build-fastfood-AI-Agentsmulti-agent-AI-systemAI-ChatbotAI-Frameworkchatbot-DevelopmentAI-orchestrationYT/2025/M03YT/2025/W12

OpenAI - NEW API & Agent Tools Breakdown
Sam Witteveen

OpenAI - NEW API & Agent Tools Breakdown

OpenAIchatgptchat-gptChatGPTGPT-4GPTchatgpt-agentsopenai-agentsopenai-assistantai-agentsai-agentai-assistantchatgpt-apigpt-4gpt-4ogpt-4o-miniobservbabilitycomputer-useagent-toolsagents-sdknew-toolsopenai-new-toolsoperatorchatgpt-tasksGeminiAnthropicgpt-3-apiresponse-APIcompletions-APIweb-search-toolfile-searchmetadata-filteringCUA-modelresponses-APIOpenAI-responses-APIprompts-playgroundchat-playgroundYT/2025/M03YT/2025/W11

Gemma 3 - The NEW Gemma Family Members Have Arrived!!!
Sam Witteveen

Gemma 3 - The NEW Gemma Family Members Have Arrived!!!

gemma-3google-gemma1B4B12B27GGemma-2multilingualKV-cache128K-tokensinstruction-finetuneddata-distillationSigLIPknowledge-distillationRoPEtransformervision-encoderreinforcement-learningLMSys-Chatbot-ArenaElo-scoresMMLUlivecodebenchmathGemini-2.0Gemini-1.5-ProQwen2.5-70Bdeepmindgemma-3-4bgemma-3-12bgemma-3-27bYT/2025/M03YT/2025/W11

Mistral OCR - Multimodal & Multilingual OCR
Sam Witteveen

Mistral OCR - Multimodal & Multilingual OCR

mistralmistral-aimistral-ocrOCRPDF-OCRimage-OCRmistral-APIJSONLlamaIndexLangChainGemini-2Gemini-2.0-Flashmistralocrpixtralandrew-ngocr-llmocr-aiagentic-document-extractionagentic-ocrYT/2025/M03YT/2025/W10

Multi-Agent AI EXPLAINED How Magentic-One Works
Sam Witteveen

Multi-Agent AI EXPLAINED How Magentic-One Works

microsoft-aimagnetic-onemicrosoft-magnetic-one-aimulti-agent-aiai-systemgeneral-aidigital-assistantorchestrator-aiai-researchmicrosoft-autogenai-automationmicrosoft-autogenbenchai-toolscoding-aiweb-browsing-aiai-productivitymicrosoft-newsmicrosoft-ai-releasemodular-aiwebsurferfilesurfercodercomputer-terminalGPT-4oLangGraphCrewAIagent-frameworksmultimodalgeneralist-agentYT/2024/M11YT/2024/W46

AgentWrite with LangGraph
Sam Witteveen

AgentWrite with LangGraph

langchainlangsmithlanggraphmulti-agentOpenAInodesLLMlarge-language-modelsgenerative-aigen-aiRAGAI-chatbotchatbotspythonopenaitechcodingmachine-learningMLNLPchatgptgeminigooglemeta-aillama-indexvector-databaseLangGraphLangchainGPT4osearch-chatbotsearch-systemsearch-appRAG-searchperplexity-vs-googlellama-3.1-70Bgemini-flash-experimentalgemini-flashagentwritelongwriter10000-words-outYT/2024/M09YT/2024/W36