News Archives - Page 29 of 63

ETH Zurich and Microsoft Scientists Present SliceGPT for Enhanced Compression of Extensive Language Models via Sparsification

NewsFebruary 3, 2024227Views 0Likes 0Comments

Large language models (LLMs) like GPT-4 require considerable computational power and memory, making their efficient deployment challenging. Techniques like sparsification have been developed to reduce these demands, but can introduce additional complexities like complicated system architecture and partially realized speedup due to limitations in current hardware architectures. Compression methods for LLMs such as sparsification, low-rank approximation,…

Decoding the Brain’s Response to Language: The Role of GPT Models in Forecasting and Modifying Neural Activity

NewsFebruary 3, 2024218Views 0Likes 0Comments

Recent advances in machine learning (ML) and artificial intelligence (AI) are being applied across numerous fields thanks to increased computing power, extensive data access, and improved ML techniques. Researchers from MIT and Harvard University have used these advancements to shed light on the brain's reaction to language, using AI models to trigger and suppress responses…

This AI Study Proposes the Investigate-Consolidate-Exploit (ICE) Approach: A Fresh AI Method to Enhance the Agent’s Self-Adaptation Between Tasks

NewsFebruary 3, 2024266Views 0Likes 0Comments

The AI and machine learning fields are witnessing a revolutionary development - intelligent agents capable of adapting and evolving by incorporating past experiences into diverse tasks. These agents, crucial to AI advancement, are designed to carry out tasks effectively, learn and improve continuously, thus increasing their adaptability across different situations. A key challenge is the efficient…

Scientists at the University of Kentucky Suggest MambaTab: A Novel Mamba-Based Machine Learning Technique for Managing Tabular Data

NewsFebruary 3, 2024195Views 0Likes 0Comments

Tabular data is commonly used in various sectors like industry, healthcare, and academia due to its simplicity and interpretability. Traditional and deep learning models that process this data type often require extensive preprocessing and significant computational resources which have been problematic. Researchers from the University of Kentucky have introduced MambaTab, a new method leveraging a…

Introducing DrugAssist: A User-Interactive Model for Molecule Optimization that Utilizes Real-Time Human Interaction through Natural Language

NewsFebruary 3, 2024238Views 0Likes 0Comments

Recent advances in Large Language Models (LLMs) have allowed for significant progress in language processing and various other fields. Despite these advancements, LLMs have not been highly impactful in molecule optimization, a crucial component of drug discovery. Traditional methods focus on patterns in chemical structure data rather than incorporating expert feedback, resulting in gaps in…

Researchers at New York University construct AI that perceives from a child’s perspective

NewsFebruary 3, 2024237Views 0Likes 0Comments

Researchers from New York University (NYU) utilized children's learning processes to train artificial intelligence (AI). Published in the Science Journal, the method allows the AI system to learn from its surrounding environment instead of relying heavily on labeled data. The study was modeled after a child's learning process. To achieve this, researchers gathered a dataset…

Want Quicker, More Productive AI? Encounter FP6-LLM: the Revolution in GPU-Based Quantization for Big Language Models.

NewsFebruary 3, 2024231Views 0Likes 0Comments

In the fields of artificial intelligence and computational linguistics, experts constantly strive to optimize the performance of Large Language Models (LLMs) like GPT-3. These models, with their capacity to handle numerous language-based tasks, present a major challenge due to their size. For instance, with its 175 billion parameters, GPT-3 requires a significant amount of GPU…

Pursuing Rapidity without Compromise in Massive Language Models? Introducing EAGLE: An Innovative Machine Learning Framework Establishing New Benchmarks for Uncompromised Acceleration.

NewsFebruary 3, 2024199Views 0Likes 0Comments

Auto-regressive decoding is the gold standard in Large Language Models (LLMs), but the process can be quite time-consuming and costly. An approach called speculative sampling has emerged to resolve this, creating "drafts" of the LLMs efficiently and verifying them in parallel, significantly improving speed. However, the speed gains from speculative sampling often come at an accuracy…

Nightshade records over 250,000 downloads shortly after launch

NewsFebruary 2, 2024227Views 0Likes 0Comments

The University of Chicago's recently introduced data poisoning tool, Nightshade, garnered over a quarter of a million downloads within five days from its launch on January 18, 2024. The tool aims to shield visual artists' work from unauthorized use by AI models by subtly tampering with image pixels, confusing the model without impairing the visual…

US legislators suggest the DEFIANCE Act to deal with problematic deep fakes

NewsFebruary 2, 2024222Views 0Likes 0Comments

US legislators have introduced a bill called the Disrupt Explicit Forged Images and Non-Consensual Edits (DEFIANCE) Act in response to mounting issues of AI-pitched explicit images. For instance, Singer Taylor Swift was recently a victim of such scandalous AI-manufactured images. The progression stirred outraged reactions from the general public, swift's fans (who initiated a widespread social…

Mastercard develops an AI model using generative techniques to combat fraud

NewsFebruary 2, 2024236Views 0Likes 0Comments

Mastercard is set to enhance its real-time fraud detection capabilities with the launch of an innovative AI tool called Decision Intelligence Pro (DI Pro). Developed by the company's cybersecurity and anti-fraud divisions, DI Pro employs a recurrent neural network to identify fraudulent transactions. This system is constantly learning, processing billions of Mastercard transactions annually to…

DeepSeek-AI Launches DeepSeek-Coder Series: An Array of Open-Source Coding Models from 1.3B to 33B, Entirely Trained on 2T Tokens

NewsFebruary 2, 2024220Views 0Likes 0Comments

In the continually evolving field of software development, large language models (LLMs) have brought about notable changes, particularly in the sector of code intelligence. These advanced models have played a vital role in automating several aspects of coding such as locating bugs and generating code. This innovation in approach and execution of coding tasks has…

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All
Categories

All
Categories

All
Categories