Tag: llm

  • Developers, meet Codestral: Mistral AI debuts a new AI model aimed at transforming the coding experience 🔊

    Developers, meet Codestral: Mistral AI debuts a new AI model aimed at transforming the coding experience 🔊

    Mistral AI, the rapidly emerging French artificial intelligence firm, has officially launched Codestral, a new and highly anticipated large language model (LLM) specifically designed for code generation and assistance. This release marks a significant step for Mistral AI as it enters the competitive arena of AI-powered developer tools, signaling a potential shift in how software…

  • Streamlining decision-making with LlamaIndex’s new Agent Document Workflow

    Streamlining decision-making with LlamaIndex’s new Agent Document Workflow

    ,

    LlamaIndex has recently unveiled its innovative Agent Document Workflow (ADW) feature, marking a significant advancement in how organizations can streamline document processing and enhance decision-making capabilities. This new architecture goes beyond traditional retrieval-augmented generation (RAG) methods, introducing a more dynamic and integrated approach to handling documents. Overview of Agent Document Workflow (ADW) How ADW Works…

  • Explore ChatRTX: NVIDIA’s local AI RAG solution for RTX GPUs

    Explore ChatRTX: NVIDIA’s local AI RAG solution for RTX GPUs

    NVIDIA’s ChatRTX is an innovative demo application that brings personalized AI chat capabilities to Windows PCs equipped with RTX graphics cards. This local AI solution enables users to interact with their personal content—including documents, notes, and images—through a sophisticated chatbot powered by large language models (LLMs). At its core, ChatRTX leverages Retrieval-Augmented Generation (RAG), TensorRT-LLM,…

  • Transformer: The quiet revolution that changed Artificial Intelligence forever

    Transformer: The quiet revolution that changed Artificial Intelligence forever

    In the summer of 2017, a seemingly modest research paper titled “Attention Is All You Need” quietly emerged from Google Brain, fundamentally transforming the landscape of artificial intelligence. While it didn’t arrive with fanfare, this paper would become the foundation for virtually every major AI model we use today, from OpenAI’s ChatGPT to Meta’s Llama…

  • DeepSeek unveils V3 Language Model with remarkable efficiency

    DeepSeek unveils V3 Language Model with remarkable efficiency

    DeepSeek has introduced its latest advancement in artificial intelligence, the DeepSeek-V3, a revolutionary language model that combines exceptional performance with remarkable efficiency. This innovative system employs a Mixture-of-Experts (MoE) architecture, featuring 671 billion total parameters while activating only 37 billion for each token processing task. What sets DeepSeek-V3 apart is its unprecedented training efficiency. The…

  • OpenAI faces second major December outage: Make local AI processing more appealing

    OpenAI faces second major December outage: Make local AI processing more appealing

    OpenAI experienced another significant service disruption on Thursday, with ChatGPT, Sora, and its developer APIs going dark for over four hours, marking the second major outage this month. The incident, which began at 11 a.m. PT, affected millions of users worldwide and has reignited discussions about the reliability of cloud-based AI services. The company attributed…

  • UAE’s TII launches Falcon 3: A new generation of efficient Language Models

    UAE’s TII launches Falcon 3: A new generation of efficient Language Models

    The Technology Innovation Institute (TII), backed by the UAE government, has introduced Falcon 3, a significant advancement in Small Language Model (SLM) technology. This new family of open-source models represents a strategic move toward more accessible and efficient AI implementations. The Falcon 3 series comprises four model variants—1B, 3B, 7B, and 10B parameters—each available in…

  • Connecting data: Perplexity acquires Carbon to enhance AI search

    Connecting data: Perplexity acquires Carbon to enhance AI search

    ,

    Perplexity AI has made a significant move in the tech landscape by acquiring Carbon, a Seattle-based startup specializing in data connectivity for large language models. This strategic acquisition aims to enhance Perplexity’s AI capabilities by integrating Carbon’s advanced retrieval engine, which connects external data sources to AI systems. Users can expect to link popular applications…

  • xAI expands Grok’s accessibility with new iOS app and web platform

    xAI expands Grok’s accessibility with new iOS app and web platform

    xAI, Elon Musk’s artificial intelligence venture, is broadening access to its Grok chatbot through a new iOS application, currently in beta testing across select countries including Australia. This expansion marks a significant shift from Grok’s previous exclusivity to X (formerly Twitter) platform users. The standalone application showcases comprehensive AI capabilities, incorporating real-time data access from…

  • Top open-source Language Models in 2024

    Top open-source Language Models in 2024

    The landscape of open-source Language Models continues to evolve, with various models offering unique capabilities. Here’s a comprehensive overview of the top models currently shaping the industry in 2024. LLaMA 3 (8B-70B parameters) Meta’s upgraded model offers two variants: 8B and 70B parameters. The 70B version demonstrates exceptional efficiency in language modeling and question-answering tasks.…