Uncategorized Archives - Page 37 of 349

Efficient Quantization-Aware Training (EfficientQAT): A New Approach to Quantification in Machine Learning for Compressing Large Language Models (LLMs).

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedJuly 21, 202435Views 0Likes 0Comments

Large Language Models (LLMs) have become increasingly important in AI and data processing tasks, but their superior size leads to substantial memory requirements and bandwidth consumption. Standard procedures such as Post-Training Quantization (PTQ) and Quantized Parameter-Efficient Fine-Tuning (Q-PEFT) can often compromise accuracy and performance, and are impractical for larger networks. To combat this, researchers have…

MUSE: An Inclusive AI Platform for Assessing Machine Forgetting in Language Models

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJuly 21, 202434Views 0Likes 0Comments

Language models (LMs), used in applications such as autocomplete and language translation, are trained on a vast amount of text data. Yet, these models also face significant challenges in relation to privacy and copyright concerns. In some cases, the inadvertent inclusion of private and copyrighted content in training datasets can lead to legal and ethical…

Researchers from the University of Texas at Austin have launched PUTNAMBENCH, a thorough AI benchmarking tool for assessing the performance of Neural Theorem-Provers on Putnam Mathematical Problems.

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Large Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJuly 21, 202436Views 0Likes 0Comments

Researchers at the University of Texas (UT) in Austin have introduced a new benchmark designed to evaluate the effectiveness of artificial intelligence in solving complex mathematical problems. PUTNAMBENCH is aimed at solving a key issue facing the sector as current benchmarks are not sufficiently rigorous and mainly focus on high-school level mathematics. Automating mathematical reasoning…

The Updated Open-Source Edition of DeepSeek-V2-0628 Has Been Launched: A More Advanced Version.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJuly 21, 202431Views 0Likes 0Comments

DeepSeek has announced the launch of its advanced open-source AI model, DeepSeek-V2-Chat-0628, on Hugging Face. The update represents a significant advancement in AI text generation and chatbot technology. This new version secures the overall ranking of #11 according to the LMSYS Chatbot Arena Leaderboard, outperforming all other existing open-source models. It is an upgrade on…

An improved and speedier method to stop AI chatbot from providing harmful responses.

Algorithms, Artificial Intelligence, Computer Science and Artificial Intelligence Laboratory (CSAIL), Computer science and technology, Defense Advanced Research Projects Agency (DARPA), Electrical Engineering & Computer Science (eecs), Human-computer interaction, Machine learning, MIT Schwarzman College of Computing, MIT-IBM Watson AI Lab, Research, School of Engineering, UncategorizedJuly 21, 202438Views 0Likes 0Comments

AI chatbots pose unique safety risks—while they can write computer programs or provide useful summaries of articles, they can also potentially generate harmful or even illegal instructions, including how to build a bomb. To address such risks, companies typically use a process called red-teaming. Human testers aim to generate unsafe or toxic content from AI…

A new technique in artificial intelligence accurately recognizes uncertainty in health imagery.

Artificial Intelligence, Broad Institute, Computer Science and Artificial Intelligence Laboratory (CSAIL), Computer science and technology, Electrical Engineering & Computer Science (eecs), Health care, Imaging, Machine learning, MIT Schwarzman College of Computing, National Institutes of Health (NIH), Research, School of Engineering, UncategorizedJuly 21, 202439Views 0Likes 0Comments

A research team from MIT, the Broad Institute of MIT and Harvard, and Massachusetts General Hospital has developed an artificial intelligence (AI) tool, named Tyche, that presents multiple plausible interpretations of medical images, highlighting potentially important and varied insights. This tool aims to address the often complex ambiguity in medical image interpretation where different experts…

Are We Prepared for Reasoning with Multiple Images? Introducing VHs: The Visual Haystacks Benchmark is Now Live!

UncategorizedJuly 21, 202432Views 0Likes 0Comments

Processing visual information effectively is a key step towards achieving Artificial General Intelligence (AGI). Although much progress has been made in artificial intelligence technologies, conventional Visual Question Answering (VQA) systems are still restricted by the inability to process and reason about more than one image at a time. The “Multi-Image Question Answering” (MIQA) task seeks…

Q-Sparse: A Novel AI Method to Achieve Complete Sparsity of Activations in Large Language Models (LLMs)

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJuly 20, 202439Views 0Likes 0Comments

Large Language Models (LLMs) are vital for tasks in natural language processing but they encounter issues when it comes to deployment. This is due to their substantial computational and memory requirements during inference. Current research studies are focused on boosting LLM efficiency by applying methods such as quantization, pruning, distillation, and improved decoding. One of…

Does Generative AI Enhance Personal Creativity but Decrease Collective Originality?

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Technology, UncategorizedJuly 20, 202439Views 0Likes 0Comments

Generative artificial intelligence (AI) technologies, like Large Language Models (LLMs), are showing promise in areas like programming processes, customer service productivity, and collaborative storytelling. However, their impact on human creativity, a cornerstone of our behavior, is still somewhat unknown. To investigate this, a research team from the University College London and the University of Exeter…

EM-LLM: An Innovative and Adaptable Structure Incorporating Critical Elements of Human Episodic Memory and Event Comprehension into Transformer-oriented Language Models

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJuly 20, 202434Views 0Likes 0Comments

Large language models (LLMs) are being extensively used in multiple applications. However, they have a significant limitation: they struggle to process long-context tasks due to the constraints of transformer-based architectures. Researchers have explored various approaches to boost LLMs' capabilities in processing extended contexts, including improving softmax attention, reducing computational costs and refining positional encodings. Techniques…

NeedleBench: An Adaptable Dataset Framework Containing Tasks to Assess the Performance of Language Models in Bilingual Long-Context Scenarios Across Various Length Ranges.

Researchers from the Shanghai AI Laboratory and Tsinghua University have developed NeedleBench, a novel framework to evaluate the retrieval and reasoning capabilities of large language models (LLMs) in exceedingly long contexts (up to 1 million tokens). The tool is critical for real-world applications such as legal document analysis, academic research, and business intelligence, which rely…

Pinokio 2.0: An Improved Pinokio Web Browser that Enables You to Install, Operate, and Automate Any Artificial Intelligence Locally on Your Computer

AI Shorts, AI Tool, Applications, Artificial Intelligence, Editors Pick, Staff, Tech News, Technology, UncategorizedJuly 20, 202441Views 0Likes 0Comments

The use of offline web and AI apps often encounters several hurdles. Users typically face multiple steps to get an app up and running, and those who aren't technically proficient may find the process confusing and lengthy. Furthermore, the management and customization of these apps often necessitate manual file editing, exacerbating the problem. However, the introduction…

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All
Categories

All
Categories

All
Categories