Fine-Tuning Llama-3.1-8B for Function Calling using LoRALeveraging Unsloth for fine-tuning with Weights & Biases integration for monitoring and vLLM for model serving6d ago6d ago
Ten ways to Serve Large Language Models: A Comprehensive GuideDeploying large language models (LLMs) can be a challenging task, especially with the growing complexity of models and hardware…Oct 24Oct 24
Evaluating Multimodal Models with LLaVA-CriticRefining Multimodal Models with Enhanced Evaluation TechniquesOct 18Oct 18
Crawl4AI: Unleashing Efficient Web ScrapingIn today’s data-driven world, the ability to efficiently gather and process information is paramount for the success of artificial…Oct 18Oct 18
Meet Ministral 3B and 8B: Edge AI Game-ChangersMistral AI’s New Frontier in Edge AI and On-Device ComputingOct 17Oct 17
Multi-Modal RAG: A Practical GuideUsing vLLM to serve models for Multimodal Text Summarization, Table Processing, and Answer SynthesisSep 17Sep 17
vLLM: Efficient Serving with Scalable PerformanceA guide to serving multimodal models like LLaVA on a CPU with vLLMSep 14Sep 14
Deploy Seamlessly with Red Hat OpenShift Local for MacIn today’s fast-paced development environment, having a streamlined local setup for deploying and testing backend applications is…Jul 28Jul 28