Claude 3.5: Varias Novedades en IA y uso de ordenadores

Anthropic actualiza Claude 3.5 Sonnet con mejoras en codificación y lanza Claude 3.5 Haiku, junto con una beta innovadora para el uso de computadoras.

Anthropic ha lanzado importantes actualizaciones y novedades en su línea de modelos de inteligencia artificial Claude. La compañía presentó una versión mejorada de Claude 3.5 Sonnet, que ofrece avances significativos en comparación con su predecesor.

Además, Anthropic anunció un nuevo modelo llamado Claude 3.5 Haiku, que se destaca por su velocidad y costo accesible, junto con la introducción en beta pública de una revolucionaria capacidad de «uso de computadoras», que permite a los desarrolladores dirigir a Claude para que interactúe con las computadoras como lo haría un ser humano.

Impressionism - DALL-E — *Imagen DALL-E*

Claude 3.5 Sonnet: Potencia en Codificación

La versión mejorada de Claude 3.5 Sonnet muestra avances en todas las áreas, pero destaca especialmente en tareas de codificación. Según las pruebas realizadas por Anthropic, este modelo supera a todos los sistemas disponibles públicamente en benchmarks específicos de codificación y uso de herramientas.

En el SWE-bench Verified, por ejemplo, su desempeño aumentó de un 33.4% a un 49.0%, superando incluso a sistemas especializados en codificación avanzada como OpenAI o1-preview.

La empresa ha colaborado con clientes como GitLab para probar el nuevo modelo en tareas de DevSecOps, reportando un incremento del 10% en habilidades de razonamiento sin aumentar la latencia. Otros clientes, como Cognition, han experimentado mejoras notables en codificación, planificación y resolución de problemas, superando los resultados de versiones anteriores.

Claude 3.5 Haiku: Rapidez y Eficiencia

El nuevo modelo Claude 3.5 Haiku, que se lanzará este mes, es una actualización del modelo más rápido de la generación anterior. Comparado con Claude 3 Opus, Haiku 3.5 iguala su rendimiento en múltiples evaluaciones, pero con un costo y velocidad similares a su predecesor, Claude 3 Haiku.

Se ha destacado en tareas de codificación, obteniendo un 40.6% en el SWE-bench Verified, superando a agentes que usan modelos públicos de última generación.

Anthropic planea que este modelo esté disponible en su API, en Amazon Bedrock y en Vertex AI de Google Cloud. Inicialmente será solo de texto, pero se espera que se amplíe a entradas de imágenes en el futuro cercano.

Uso de Computadoras: Innovación en Beta Pública

La funcionalidad más revolucionaria que Anthropic ha lanzado en beta pública es la capacidad de uso de computadoras. Claude 3.5 Sonnet es el primer modelo que permite a los desarrolladores dirigir sus acciones para interactuar con una computadora, utilizando interfaces de usuario y software estándar de la misma forma que lo haría una persona. Esta característica permite automatizar procesos repetitivos, probar software y realizar tareas abiertas como la investigación.

Anthropic menciona que la beta aún es experimental y presenta ciertas limitaciones, como la dificultad para realizar acciones aparentemente sencillas como desplazar, arrastrar o hacer zoom. No obstante, empresas como Asana, Canva y Replit ya han comenzado a probar esta función para optimizar sus procesos y productos, indicando que podría tener un impacto significativo en la eficiencia y la autonomía de la inteligencia artificial.

Seguridad y Ética en el Uso de IA

Para garantizar un uso seguro de estas nuevas capacidades, Anthropic ha realizado pruebas en conjunto con el Instituto de Seguridad en IA de EE.UU. (US AISI) y su homólogo en el Reino Unido (UK AISI). La compañía sigue aplicando sus políticas de escalado responsable y ha evaluado los riesgos potenciales de este modelo.

Además, han desarrollado clasificadores que permiten identificar el uso de la capacidad de uso de computadoras y detectar cualquier actividad malintencionada.

Mirando al Futuro

Anthropic se muestra entusiasta sobre las posibilidades que abrirán estas nuevas herramientas y capacidades. La compañía ha invitado a los desarrolladores a experimentar con la beta pública y a proporcionar retroalimentación para mejorar y evolucionar la tecnología. El objetivo a largo plazo es entender mejor el potencial y las implicaciones de sistemas de IA cada vez más avanzados.

La nueva versión de Claude 3.5 Sonnet ya está disponible para todos los usuarios a través de la API de Anthropic, Amazon Bedrock y Vertex AI de Google Cloud. Por su parte, Claude 3.5 Haiku se lanzará en las próximas semanas, continuando con la expansión y mejora de la línea de productos de inteligencia artificial de Anthropic.

Anthropic sigue como uno de los líderes en el desarrollo de inteligencia artificial avanzada con sus modelos Claude 3.5 Sonnet y Haiku. Las mejoras en codificación y la innovación en el uso de computadoras representan un paso significativo hacia la automatización de tareas complejas, ofreciendo a los desarrolladores nuevas herramientas para explorar el potencial de estas tecnologías.

Relacionado

Anthropic Lanza la Nueva Aplicación Claude en iOS para Transformar la Productividad Móvil

En un importante desarrollo dentro del campo de la inteligencia artificial, Anthropic ha anunciado hoy el lanzamiento de su nueva aplicación Claude para iOS. Esta aplicación Claude para iOS fue diseñada para llevar las capacidades avanzadas de IA directamente a los dispositivos móviles de los usuarios. Anthropic señala que su…

1 mayo 2024

En «Aplicaciones»

Claude 2.1 de Anthropic, un chat conversacional más potente y seguro

Anthropic, la empresa de inteligencia artificial (IA) fundada por ex ingenieros de Google, ha lanzado Claude 2.1, una versión actualizada de su IA conversacional. La nueva versión Claude 2.1 incluye una serie de mejoras que la hacen más potente y segura, lo que la hace más adecuada para su uso…

22 noviembre 2023

En «Inteligencia Artificial»

IA de Anthropic Alcanza Nueva Cota de Conciencia: Claude 3 Opus y el Desafío de la Aguja en el Pajar

En un giro sorprendente y revelador dentro del campo de la inteligencia artificial, un integrante del equipo detrás de Claude 3 Opus, la última iteración de los modelos de lenguaje de Anthropic, ha compartido una anécdota fascinante que destaca las capacidades mejoradas y la conciencia emergente de esta tecnología. Durante…

5 marzo 2024

En «Inteligencia Artificial»

Siguenos por Twitter a través de @Geeksroom y no te pierdas todas las noticias, cursos gratuitos y demás artículos. También puedes seguirnos a través de nuestro canal de Youtube para ver nuestros vídeos, a través de Instagram para ver nuestras imágenes! O vía Bluesky si ya estás cansado de Twitter

Cookie	Duración	Descripción
AWSALBCORS	7 days	Amazon Web Services set this cookie for load balancing.
consentUUID	1 year	This cookie is used as a unique identification for the users who has accepted the cookie consent box.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
_csrf	session	This cookie is essential for the security of the website and visitor. It ensures visitor browsing security by preventing cross-site request forgery.

Cookie	Duración	Descripción
na_id	1 year 1 month	The na_id is set by AddThis to enable sharing of links on social media platforms like Facebook and Twitter.
na_rn	1 month	The na_rn cookie is used to recognize the visitor upon re-entry. It allows to record details on user behaviour and facilitate the social sharing function provided by Addthis.com.
na_sc_e	1 month	The na_sc_e cookie is used to recognize the visitor upon re-entry. It allows to record details on user behaviour and facilitate the social sharing function provided by Addthis.com.
na_sr	1 month	The na_sr cookie is used to recognize the visitor upon re-entry. It allows to record details on user behaviour and facilitate the social sharing function provided by Addthis.com.
na_srp	1 minute	The na_srp cookie is used to recognize the visitor upon re-entry. It allows to record details on user behaviour and facilitate the social sharing function provided by Addthis.com.
na_tc	1 year 1 month	The na_tc cookie is used to recognize the visitor upon re-entry. It allows to record details on user behaviour and facilitate the social sharing function provided by Addthis.com.
ouid	1 year 1 month	Associated with the AddThis widget, this cookie helps users to share content across various networking and sharing forums.
__cf_bm	30 minutes	Cloudflare set the cookie to support Cloudflare Bot Management.

Cookie	Duración	Descripción
AWSALB	7 days	AWSALB is an application load balancer cookie set by Amazon Web Services to map the session to the target.
d	3 months	Quantserve sets this cookie to anonymously track information on how visitors use the website.

Cookie	Duración	Descripción
ANON_ID	3 months	This cookie, set by Tribal Fusion, collects data on user visits to the website, such as what pages have been accessed .
CONSENT	2 years	YouTube sets this cookie via embedded YouTube videos and registers anonymous statistical data.
suid	1 year	Simpli. fi sets this cookie to store a distinct session ID.
u	1 year	This cookie is used by Bombora to collect information that is used either in aggregate form, to help understand how websites are being used or how effective marketing campaigns are, or to help customize the websites for visitors.
uid	2 months	This is a Google UserID cookie that tracks users across various website segments.
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_gat_gtag_UA_*	1 minute	Google Analytics sets this cookie to store a unique user ID.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
__gads	1 year 24 days	Google sets this cookie under the DoubleClick domain, tracks the number of times users see an advert, measures the campaign's success, and calculates its revenue. This cookie can only be read from the domain they are currently on and will not track any data while they are browsing other sites.

Cookie	Duración	Descripción
A3	1 year	Yahoo set this cookie for targeted advertising.
ab	1 year	Owned by agkn, this cookie is used for targeting and advertising purposes.
anj	3 months	AppNexus sets the anj cookie that contains data stating whether a cookie ID is synced with partners.
ANON_ID_old	3 months	This cookie helps to categorise the users interest and to create profiles in terms of resales of targeted marketing. This cookie is used to collect user information such as what pages have been viewed on the website for creating profiles.
cid_*	1 year	Crimtan sets this cookie as remarketing cookie that is used to send relevant ads to users on subsequent sites.
CMID	1 year	Casale Media sets this cookie to collect information on user behaviour for targeted advertising.
CMPRO	3 months	CasaleMedia sets CMPRO cookie for anonymous usage tracking and targeted advertising.
CMPS	3 months	CasaleMedia sets CMPS cookie for anonymous user tracking based on users' website visits to display targeted ads.
DSID	1 hour	This cookie is set by DoubleClick to note the user's specific user identity. It contains a hashed/encrypted unique ID.
everest_g_v2	1 year	The cookie is set under the everesttech.net domain to map clicks to other events on the client's website.
gid_*	1 year	Crimtan sets this cookie to enable targeted advertising and user profiling.
GoogleAdServingTest	session	Google sets this cookie to determine what ads have been shown to the website visitor.
IDE	1 year 24 days	Google DoubleClick IDE cookies store information about how the user uses the website to present them with relevant ads according to the user profile.
mc	1 year 1 month	Quantserve sets the mc cookie to track user behaviour on the website anonymously.
mt_mop	1 month	MediaMath uses this cookie to synchronize the visitor ID with a limited number of trusted exchanges and data partners.
pxrc	2 months	This cookie is set by pippio to provide users with relevant advertisements and limit the number of ads displayed.
rlas3	1 year	RLCDN sets this cookie to provide users with relevant advertisements and limit the number of ads displayed.
suid_legacy	1 year	Collects information on user preferences and interaction with web-campaign content which is used on CRM-campaign-platforms used by website owners for promoting events or products.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
UserID1	3 months	Adition sets this cookie as a unique anonymous ID for a website visitor. This ID is used to identify the user across sessions and to track their activity on the website. The data collected is used for analysis purposes.
uuid	1 year 27 days	MediaMath sets this cookie to avoid the same ads from being shown repeatedly and for relevant advertising.
uuid2	3 months	The uuid2 cookie is set by AppNexus and records information that helps differentiate between devices and browsers. This information is used to pick out ads delivered by the platform and assess the ad performance and its attribute payment.
wfivefivec	1 year 1 month 1 day	W55c sets this cookie to collect data on the user's visits to the website, such as what pages have been loaded. The registered data is used for targeted ads.
_gu	1 month	GetSiteControl sets this cookie to track user information from marketing campaigns.
__gpi	1 year 24 days	Google Ads Service uses this cookie to collect information about from multiple websites for retargeting ads.

Tecnología, Movilidad y Estilo de vida.

Claude 3.5 con Importantes Novedades y la IA Toma el Control de Ordenadores