Categories
Life Style

La IA DeepSeek de China puede ser más inteligente que la IA más inteligente de OpenAI

[ad_1]

hay algo nuevo Amnistía Internacional Hay un jugador en la ciudad y es posible que quieras prestarle atención.

El lunes, la empresa china de inteligencia artificial profundamente enfermo DeepSeek R1 ha lanzado un nuevo modelo de código abierto para un lenguaje a gran escala.

Según DeepSeek, R1 supera a otros LLM (modelos de lenguajes grandes) populares, por ejemplo Abierto AI en varios Criterios importantesy el es Especialmente bueno Con tareas matemáticas, de codificación y de pensamiento.

DeepSeek R1 es en realidad una mejora de DeepSeek R1 Zero, un LLM formado sin un método utilizado tradicionalmente llamado ajuste fino supervisado. Esto lo hacía muy capaz de realizar ciertas tareas, pero como dijo el propio DeepSeek, el Zero era “legible y ciego al lenguaje”. Ingrese a R1, que soluciona estos problemas incorporando “entrenamiento en múltiples etapas y datos de arranque en frío” antes de entrenarlo con aprendizaje por refuerzo.

Velocidad de la luz triturable

Dejando a un lado el lenguaje técnico vago (los detalles son… conectado Si está interesado), hay varias cosas clave que debe saber sobre DeepSeek R1. En primer lugar, es de código abierto, lo que significa que está examinado por expertos, lo que debería aliviar las preocupaciones sobre la privacidad y la seguridad. En segundo lugar, su uso como aplicación web es gratuito, mientras que el acceso API está disponible. muy barato ($0,14 por millón de tokens de entrada, en comparación con Abierto AI $7,5 por su modelo de razonamiento más poderoso, o1).

Lo más importante es que esta cosa es muy, muy capaz. Para probarlo, lo lancé inmediatamente a aguas profundas y le pedí que codificara una aplicación web bastante compleja que necesitaba analizar datos disponibles públicamente y crear un sitio web dinámico que contuviera información meteorológica y de viajes para turistas. Sorprendentemente, DeepSeek produjo un código HTML bastante aceptable de inmediato y pudo mejorar aún más el sitio basándose en mis comentarios mientras refinaba y optimizaba el código por sí solo a lo largo del camino.

Inteligencia artificial de búsqueda profunda

Lo haré todo… mañana.
Crédito: Stan Schroeder/Mashable/DeepSeek

También le pedí que mejorara mis habilidades de ajedrez en cinco minutos, y respondió con una serie de consejos muy útiles y cuidadosamente seleccionados (mis habilidades de ajedrez mejoraron, pero sólo porque era demasiado vago para seguir las sugerencias de DeepSeek). .

Luego le pedí a DeepSeek que demostrara lo inteligente que es en exactamente tres oraciones. Mal movimiento de mi parte, porque yo, humano, no soy lo suficientemente inteligente como para verificar o incluso comprender completamente ninguna de las tres frases. Tenga en cuenta que en la captura de pantalla siguiente puede ver el “proceso de pensamiento” de DeepSeek mientras descubre la respuesta, que probablemente sea incluso más fascinante que la respuesta misma.

Inteligencia artificial de búsqueda profunda

Lo entendemos, eres inteligente.
Crédito: Stan Schroeder/Mashable/DeepSeek

Es impresionante de usar. Pero como ZDnet tomó notaDetrás de todo esto hay costos de capacitación que son un orden de magnitud más bajos que los de algunos modelos de la competencia, así como chips que no son tan poderosos como los disponibles para las empresas estadounidenses de inteligencia artificial. DeepSeek demuestra así que una IA altamente inteligente con capacidad de razonar no tiene por qué ser muy costosa de entrenar o utilizar.



[ad_2]

Source Article Link

Categories
News

New open source AI coding assistant DeepSeek released

DeepSeek LLM open source AI coding assistant

Developers, coders and enthusiasts may be interested in a new open source AI coding assistant model in the form of the DeepSeek large language model (LLM).  DeepSeek, a company that’s been working under the radar, has recently released an open-source coding model that’s making waves in the tech community. This model, known as the DeepSeek coder model, boasts an impressive 67 billion parameters, putting it in the same league as some of the most advanced AI models out there, like GPT-4.   The open source AI coding assistant has been trained from scratch on a vast dataset in both English and Chinese.

  • Superior General Capabilities: DeepSeek LLM 67B Base outperforms Llama2 70B Base in areas such as reasoning, coding, math, and Chinese comprehension.

  • Proficient in Coding and Math: DeepSeek LLM 67B Chat exhibits outstanding performance in coding (HumanEval Pass@1: 73.78) and mathematics (GSM8K 0-shot: 84.1, Math 0-shot: 32.6). It also demonstrates remarkable generalization abilities, as evidenced by its exceptional score of 65 on the Hungarian National High School Exam.

  • Mastery in Chinese Language: Based on our evaluation, DeepSeek LLM 67B Chat surpasses GPT-3.5 in Chinese.

What makes the DeepSeek coder model stand out is its extensive training on a dataset comprising two trillion tokens. This vast amount of data has given the model a wide-ranging understanding and knowledge base, allowing it to perform at levels that exceed Llama 2’s 70 billion base model and show competencies akin to GPT-3.5. This achievement has quickly made it a notable competitor in the AI landscape.

But DeepSeek didn’t stop there. They’ve been continuously improving their model. With the release of version 1.5, they’ve added an extra 1.4 trillion tokens of coding data to the model’s training, which has significantly enhanced its capabilities. This upgrade means that the DeepSeek coder model is now even more adept at handling complex tasks, such as natural language programming and mathematical reasoning. It’s become an essential tool for those who need to simplify intricate processes.

DeepSeek open source AI coding assistant

“We release the DeepSeek LLM 7B/67B, including both base and chat models, to the public. To support a broader and more diverse range of research within both academic and commercial communities, we are providing access to the intermediate checkpoints of the base model from its training process. Please note that the use of this model is subject to the terms outlined in License section. Commercial usage is permitted under these terms.”

The model’s versatility is also worth mentioning once again as it supports multiple languages, including Chinese, which opens up its benefits to a wider, international audience. This is particularly important as the demand for advanced AI technology grows across different regions and industries.

DeepSeek LLM vs LLaMA 2

DeepSeek open source AI coding model benchmarking

For those interested in using the DeepSeek AI coding assistant, it’s readily available on platforms like Hugging Face and LM Studio.and is available to download in both 7 Billion and 33 Billion versions. This accessibility ensures that users who need cutting-edge AI can easily integrate it into their work. The model’s technical capabilities are further showcased by its ability to predict the next token in a sequence with a window size of 4K, which means it can produce outputs that are more nuanced and aware of the surrounding context. Additionally, the model has been fine-tuned on 2 billion tokens of instruction data, which guarantees that it can understand and carry out complex instructions with remarkable accuracy.

The research and development team responsible for creating this unique advanced language model comprising of 67 billion parameters have future plans for its development, and the DeepSeek AI coding assistant is likely just the start of their journey. They’ve hinted at future developments that could redefine the limits of AI models. This suggests that we can expect more innovative tools from DeepSeek that will continue to shape the future of various industries and applications.

The DeepSeek coder model is a significant step forward in the realm of open-source AI technology. With its advanced features and strong performance, it’s an excellent option for anyone in need of an AI model that specializes in coding and mathematics. As the AI community continues to expand, the DeepSeek coder model stands as a prime example of the kind of innovative, powerful, and adaptable tools that are driving progress across different fields. To give the AI coding assistant try jump over to the official DeepSeek Alpha website.

Filed Under: Gadgets News





Latest timeswonderful Deals

Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, timeswonderful may earn an affiliate commission. Learn about our Disclosure Policy.

Categories
News

Deepseek Coder vs CodeLlama vs Claude vs ChatGPT AI coding

Deepseek Coder vs CodeLlama vs Claude vs ChatGPT AI coding assistants compared

If you are looking for an artificial intelligent AI coding assistant you might be interested in learning more about a new AI model which is showing excellent results when compared to others such as CodeLlama. The world of artificial intelligence is changing on a daily basis and Deepseek Coder, an AI model and coding assistant developed by a team of researchers in Beijing, is setting new standards in the field. This model has outshone its competitors, including the well-known CodeLlama, in various benchmarks, showcasing its superior capabilities.

One of the most impressive aspects of the Deepseek Coder is its scalable architecture. It comes in three different sizes, with 1 billion, 7 billion, and 33 billion parameters, making it versatile enough to handle a wide range of applications. The smallest version is perfect for edge devices or quick GPU inference tasks, which is a big step forward for edge computing, where AI needs to be both practical and efficient.

Another area where Deepseek Coder excels is its licensing model. Unlike other AI models that come with restrictive licensing, Deepseek Coder offers a more permissive approach. This means it can be used for both open-source projects and commercial purposes, giving developers and businesses more freedom to innovate and expand their use of AI.

Deepseek Coder vs CodeLlama vs Claude vs ChatGPT

Here are some other articles you may find of interest on the subject of AI coding assistants and tools:

When it comes to integrating and deploying AI models, the format of the prompts used can make a big difference. Deepseek Coder’s intuitive prompt design makes it easy to call functions and perform context-aware inference. This is especially useful for creating AI chat interfaces that are user-friendly or for integrating with platforms like Runpod.

The model’s GPU inference efficiency is another standout feature. It ensures quick and effective processing, which is essential for commercial AI applications that require real-time interaction and high throughput. Deepseek Coder also excels in handling long context inference. This is crucial for generating coherent and contextually accurate responses during interactions. The model’s ability to manage long contexts is a testament to its advanced design and the thorough analysis by its creators.

For developers looking to integrate AI into their systems, Deepseek Coder’s function calling feature is a game-changer. It simplifies the integration process, improving the overall developer experience. This is complemented by Trellis fine-tuned models, which are optimized for specific tasks and industries, ensuring top-notch performance.

To help with the adoption of Deepseek Coder, there are several resources available. The Hugging Face repository provides pre-trained models and a space for community contributions. For those who prefer a more hands-on approach, Google Colab offers collaborative notebooks that are perfect for experimentation and development.

Runpod’s AI templates are another resource that can be incredibly helpful. They provide a seamless deployment process with cloud computing environments that are ready to use, which is a great advantage for developers who want to get their AI projects off the ground quickly.

The fine-tuning capabilities of Deepseek Coder are comparable to those of Llama models. This allows for customization to suit the specific needs of your datasets and applications, giving you the flexibility to tailor the AI to your requirements.

Overall, the Deepseek Coder is a powerful tool in the world of AI innovation. With its scalable design, flexible licensing, advanced features, and a wealth of resources for developers, it is well-equipped to help professionals in various industries explore and push the boundaries of AI technology.

Filed Under: Guides, Top News





Latest timeswonderful Deals

Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, timeswonderful may earn an affiliate commission. Learn about our Disclosure Policy.

Categories
News

Deepseek Coder AI open source coding assistant

Deepseek Coder open source AI coding assistant runs online and locally

If you could do with a little assistance when coding or when learning a new coding language you might be interested in a new AI coding assistant in the form of Deepseek Coder. The AI coding assistant has been created using a series of code language models trained on both 87% code and 13% natural language in English and Chinese, with each model pre-trained on 2T tokens.

Various sizes of the Deepseek Coder AI coding assistant are available from 1B to 33B versions. The advanced coding application harnesses the power of artificial intelligence (AI) to streamline the software development process. The Deepseek Coder AI coding tool operates on an extendable framework, enabling scalability and adaptability to a wide array of project requirements. Deepseek is a web-based application, making it accessible via the internet from anywhere. However, the application can also be accessed locally, offering flexibility to developers who prefer or need to work offline.

Deepseek Coder training

Deepseek Coder also offers efficient code generation capabilities using multiple communicative agents to facilitate software communication, thereby enhancing the efficiency of code creation. This feature allows developers to generate code more quickly and accurately, reducing the time and effort required in the development process. One of the key features of Deepseek Coder is its pre-trained model offering AI coding assistant for 80 programming languages, including popular ones like Python and JavaScript. The application also supports project-level code completion and code infilling, further enhancing its utility for developers.

Features of Deepseek Coder AI coding assistant

– Pretrained on 2 Trillion tokens over more than 80 programming languages.
– Various model sizes (1.3B, 5.7B, 6.7B and 33B) to support different requirements.
– A window size of 16K window** size, supporting project-level code completion and infilling.
– State-of-the-Art performance among open code models.
– Open source and free for research and commercial use.

Other articles you may find of interest on the subject of AI coding assistance :

The AI model is an open-source application, meaning it is free-to-use software that can be modified and distributed by anyone. This open-source nature fosters a collaborative environment where developers can contribute to the application’s improvement. The application can be used for free for research purposes, and it also supports commercial use cases. The capabilities of Deepseek Coder have been tested in various scenarios to ensure its effectiveness and reliability. For instance, it has been used to create a snake game within seconds, demonstrating its potential in game development. The application’s performance has also been compared with other open-source code models such as ChatDev, with promising results.

Benefits of using an AI coding assistant

  • Efficiency Improvement: AI coding assistants can significantly speed up the coding process by suggesting code snippets, completing lines of code, and automating repetitive tasks.
  • Error Reduction: They help in detecting and correcting syntax errors, code smells, and even identifying potential bugs before runtime, which leads to cleaner code and fewer errors.
  • Learning and Upskilling: Coders can learn from AI suggestions, discovering new functions, libraries, and coding patterns that they might not be familiar with.
  • Code Refactoring: AI assistants can suggest improvements and optimizations to existing code, making it more maintainable and performant.
  • Language Agnosticism: Many AI coding assistants support multiple programming languages, allowing developers to switch between projects without losing productivity.
  • Accessibility: They make coding more accessible to beginners by providing inline documentation and explanations, thus lowering the barrier to entry for programming.
  • Integrations: These tools often integrate with IDEs and other development tools, creating a seamless development environment.
  • Documentation Assistance: They can help generate comments and documentation, ensuring that the codebase is understandable and easier to maintain.
  • Resource Optimization: By automating certain tasks, they free up human resources to focus on more complex and creative aspects of software development.
  • 24/7 Availability: Unlike human counterparts, AI coding assistants are available around the clock, providing assistance whenever needed.

Access and availability

Deepseek Coder can be accessed from the official website or can be downloaded from GitHub for local installation. The website provides easy access and start-up, while local installation allows developers to run the application on their own machines. The application can also be run on LM Studio, an AI platform that supports open-source models. To learn more about LM studio check out our previous articles.

Deepseek Coder is an advanced coding application that utilizes AI to streamline the software development process. Its extendable framework, efficient code generation capabilities, and pre-trained model make it a powerful tool for developers. As an open-source application, it fosters collaboration and is accessible to all. With its impressive capabilities, Deepseek Coder is set to make a significant impact in the world of software development.

Filed Under: Guides, Top News





Latest timeswonderful Deals

Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, timeswonderful may earn an affiliate commission. Learn about our Disclosure Policy.