Gurman: Apple Working on On-Device LLM for Generative AI Features

Gurman: Apple Working on On-Device LLM for Generative AI Features

Apple is developing its own large language model (LLM) that runs on-device to prioritize speed and privacy, Bloomberg‘s Mark Gurman reports. Writing in his “Power On” newsletter, Gurman said that Apple’s LLM underpins upcoming generative AI features. “All indications” apparently suggests that it will run entirely on-device, rather than via the cloud like most existing … Read more

Startup claims to boost LLM performance using standard memory instead of GPU HBM — but experts remain unconvinced by the numbers despite promising CXL technology

Startup claims to boost LLM performance using standard memory instead of GPU HBM — but experts remain unconvinced by the numbers despite promising CXL technology

MemVerge, a provider of software designed to accelerate and optimize data-intensive applications, has partnered with Micron to boost the performance of LLMs using Compute Express Link (CXL) technology.  The company’s Memory Machine software uses CXL to reduce idle time in GPUs caused by memory loading. The technology was demonstrated at Micron’s booth at Nvidia GTC … Read more

Claude 3 API Opus LLM performance tested

Claude 3 API Opus LLM performance tested

Earlier this week Anthropic surprise the AI community by releasing three new AI models making up the Claude 3 family. The three different-sized models: Haiku, Sonnet, and Opus are vision language models (VLMs), capable of processing both text and images. If you’re interested in learning more about the performance of the Claude 3 API Opus … Read more

Using LangGraph to create multi-agent LLM coding AI frameworks

Using LangGraph to create multi-agent LLM coding AI frameworks

LangGraph has been used to create a multi-agent large language model (LLM) coding framework. This framework is designed to automate various software development tasks, including coding, testing, and debugging. The system is built upon the LangGraph module, which enhances the LangChain ecosystem by enabling the creation of AI agents. The framework features specialized agents, each … Read more

StarCoder2 LLM AI model designed for developers

StarCoder2 LLM AI model designed for developers

StarCoder2 is an advanced open-source coding language model designed for developers, is being made offering three variants with different parameter sizes: 3 billion, 7 billion, and 15 billion. It is the latest version of the Starcoder series and has been trained on a vast array of programming languages and tokens. The model is noted for … Read more

New Mistral Next prototype large language model (LLM)

New Mistral Next prototype large language model (LLM)

Mistral AI has released a new prototype large language model (LLM) named Mistral Next without much prior information or details. The model is currently available for testing on the Chatbot Arena platform. Users are encouraged to try it out and provide feedback. The model’s capabilities, training, and architecture remain undisclosed, but it has demonstrated impressive … Read more

MiniCPM 2B small yet powerful large language model (LLM)

MiniCPM 2B small yet powerful large language model (LLM)

In the rapidly evolving world of artificial intelligence, a new AI large language model (LLM) has been created in the form of the MiniCPM 2B, a compact AI LLM, offering a level of performance that rivals some of the biggest names in the field. With its 2 billion parameters, it stands as a formidable alternative … Read more

LLMWare unified framework for developing LLM apps with RAG

LLMWare unified framework for developing LLM apps with RAG

An innovative framework called LLMWare has been developed to provide users with a unified framework for developing projects and applications using large language models (LLMs) . This innovative tool is designed to help developers create applications that are powered by large language models. With its advanced retrieval augmented generation (RAG) capabilities, LLMWare is enhancing the … Read more

Build a custom AI large language model GPU server (LLM) to sell

Build a custom AI large language model GPU server (LLM) to sell

Deploying a custom language model (LLM) can be a complex task that requires careful planning and execution. For those looking to serve a broad user base, the infrastructure you choose is critical. This guide will walk you through the process of setting up a GPU server, selecting the right API software for text generation, and … Read more