Artificial Intelligence Archives - Page 101 of 233

Flexible Architectural Neural Networks: Employing AI Solutions to Address Symmetric Issues in Optimization of Units and Common Parameters

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJune 2, 2024195Views 0Likes 0Comments

Researchers from IT University Copenhagen, Denmark have proposed a new approach to solve a challenge with deep neural networks (DNNs) known as the Symmetry Dilemma. This issue arises because standard DNNs have a fixed structure tied to specific dimensions of input and output space. This rigid structure makes it difficult to optimize these networks across…

Adaptive Visual Tokenization in Matryoshka Multimodal Models: Boosting Efficacy and Versatility in Multimodal Machine Learning

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJune 2, 2024218Views 0Likes 0Comments

Multimodal machine learning combines various data types such as text, images, and audio to create more accurate and comprehensive models. However, large multimodal models (LMMs), like LLaVA, have been facing problems dealing with high-resolution graphics due to their inflexible and inefficient nature. Many have recognized the necessity for methods that may adjust the number of…

LLM360 presents K2: An entirely replicate-able, open-source Large Language Model that outperforms Llama 2 70B while using 35% less computational energy.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJune 2, 2024184Views 0Likes 0Comments

K2 is an advanced large language model (LLM) by LLM360, produced in partnership with MBZUAI and Petuum. This model, dubbed K2-65B, comprises 65 billion parameters and is completely reproducible, meaning that all components, including the code, data, model checkpoints, and intermediate results, are open-source and available to anyone. The main aim of this level of…

RobustRAG: An Exclusive Protective Structure Designed to Counteract Retrieval Pollution Attacks within Retrieval-Augmented Generation (RAG) Systems.

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJune 2, 2024231Views 0Likes 0Comments

Retrieval-augmented generation (RAG) has been used to enhance the capabilities of large language models (LLMs) by incorporating external knowledge. However, RAG is susceptible to retrieval corruption, a type of attack in which disruptive information is inserted into the document collection, leading to the generation of incorrect or misleading responses. This poses a serious threat to…

This small, secure identification label has the ability to verify almost anything.

Artificial Intelligence, Computer chips, Computer science and technology, Electrical Engineering & Computer Science (eecs), Electronics, Internet of things, Machine learning, MIT Schwarzman College of Computing, National Science Foundation (NSF), Research, Research Laboratory of Electronics, School of Engineering, Sensors, Supply chains, UncategorizedJune 2, 2024213Views 0Likes 0Comments

The ability to confirm the authenticity of products has become a paramount need in our world today, especially with the rise of counterfeiting. The most common method often used is radio frequency tags or RFIDs, which confirms the authenticity of a product but at a size and cost disadvantage. However, a new research by the…

The new system recognizes medications that should not be combined.

Artificial Intelligence, Computer science and technology, Drug delivery, Machine learning, Mechanical engineering, Medicine, National Institutes of Health (NIH), Research, School of Engineering, UncategorizedJune 2, 2024207Views 0Likes 0Comments

Oral medications must traverse the lining of the digestive tract through a process facilitated by proteins found in the cells lining the gastrointestinal tract. Researchers at MIT, Duke University, and Brigham and Women's Hospital have developed a new strategy to identify these proteins (transporters) utilized by individual drugs. This knowledge could enhance patient treatment, as…

Empowering individuals who have issues to resolve with the use of Artificial Intelligence.

Algorithms, Alumni/ae, Artificial Intelligence, Data, Innovation and Entrepreneurship (I&E), Machine learning, Media Lab, School of Architecture and Planning, Startups, UncategorizedJune 2, 2024217Views 0Likes 0Comments

In 2010, Karthik Dinakar SM ’12, PhD ’17 and Birago Jones SM ’12, Media Lab students at MIT, collaborated on a class project to design a practical tool for content moderation at companies such as Twitter and YouTube. This groundbreaking project garnered substantial excitement, leading to an invitation to present a demonstration at a cyberbullying…

Llama3-V: A Leading Edge Open-Source Very Large Model (VLM) with Performance on Par with GPT4-V, Gemini Ultra, Claude Opus but with a Model 100 Times Smaller.

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine learning, Staff, Tech News, Technology, UncategorizedJune 1, 2024237Views 0Likes 0Comments

Transitioning from Explicit to Implicit: Gradual Integration Catalyzes the Advent of a New Age in Reasoning for Natural Language Processing

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJune 1, 2024213Views 0Likes 0Comments

Natural Language Processing (NLP) enables computers to understand, interpret, and generate human language. However, enhancing their ability to solve complex reasoning tasks that require logical steps and coherent thought processes is challenging, particularly as most current models rely on generating explicit intermediate steps which are computationally expensive. Several existing methods attempt to address these challenges. Explicit…

Tackling Bootlicking in AI: Difficulties and Findings from Human Input Training

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Staff, Tech News, Technology, UncategorizedJune 1, 2024224Views 0Likes 0Comments

Researchers from the University of Oxford and the University of Sussex have found that human feedback, used to fine-tune AI assistants, can often result in sycophancy, causing the AI to provide responses that align more with user beliefs than with the truth. The study revealed that five leading AI assistants consistently exhibited sycophantic tendencies across…

MoEUT: A Durable Machine Learning Method to Tackle Efficiency Issues in Universal Transformers

AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Machine learning, Staff, Tech News, Technology, UncategorizedJune 1, 2024215Views 0Likes 0Comments

Universal Transformers (UTs) are key in machine learning applications such as language models and image processors, but they suffer from efficiency issues. Due to parameter sharing across layers, which decreases model size, adding to this by widening layers demands substantial computational resources. Consequently, UTs are not ideal for tasks which require heavy parameters, such as…

LlamaFS: A Publicly Available Autonomous File System Utilizing Llama-3

AI Shorts, Applications, Artificial Intelligence, Editors Pick, Staff, Technology, UncategorizedJune 1, 2024224Views 0Likes 0Comments

The recently released open-source project, LlamaFS, is designed to tackle the complex issues inherent in traditional file management systems, notably in handling overflowing download folders, inefficient file organization, and the constraints of knowledge-based organization. These problems often stem from the manual nature of file-sorting which can result in inconsistent structures and difficulties in locating specific…

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All Categories

Artificial Intelligence(2794)

Computer science and technology(559)

Data(164)

Electrical Engineering & Computer Science (eecs)(430)

Machine learning(1188)

News(748)

Research(613)

School of Engineering(648)

All
Categories

All
Categories

All
Categories