As AI systems continue to advance, researchers and policymakers are concerned about ensuring their safe and ethical use. The main issues center around the potential risks posed by ever-evolving and increasingly powerful AI systems. These risks involve potential misuse, ethical issues, and unexpected consequences stemming from AI's expanding abilities. Several strategies are being explored by…
Automated Machine Learning (AutoML) has become crucial for data-driven decision-making, enabling experts to utilize machine learning without needing extensive statistical knowledge. However, a key challenge faced by current AutoML systems is the efficient and correct handling of multimodal data, which can consume significant resources.
Addressing this issue, scientists from the Eindhoven University of Technology have put…
The existing Artificial Intelligence (AI) task management methods, including AutoGPT, BabyAGI, and LangChain, often rely on free-text outputs, which can be lengthy and inefficient. These frameworks commonly struggle with keeping context and managing the extensive action space linked with arbitrary tasks. This report focuses on the inefficiencies of these current agentic frameworks, particularly in handling…
Reinforcement Learning from Human Feedback (RLHF) plays a pivotal role in ensuring the quality and safety of Large Language Models (LLMs), such as Gemini and GPT-4. However, RLHF poses significant challenges, including the risk of forgetting pre-trained knowledge and reward hacking. Existing practices to improve text quality involve choosing the best output from N-generated possibilities,…
The United Nations' Sustainable Development Goals (SDGs) aim to eradicate poverty, protect the environment, combat climate change, and bolster global peace and prosperity by the year 2030. Despite extensive research, additional work is required to accurately forecast SDG scores, which measure progress towards these objectives. By employing ARIMAX and Linear Regression machine learning models which…
Large Language Models (LLMs) have improved significantly, but challenges persist, particularly in the prefilling stage. This is because the cost of computing attention increases with the number of tokens in the prompts, leading to a slow time-to-first-token (TTFT). As such, optimizing TTFT is crucial for efficient LLM inference.
Various methods have been proposed to improve…
Multi-target multi-camera tracking (MTMCT) has become indispensable in intelligent transportation systems, yet real-world applications are complex due to a shortage of publicly available data and laborious manual annotation. MTMCT involves tracking vehicles across multiple camera lenses, detecting objects, carrying out multi-object tracking, and finally clustering trajectories to generate a comprehensive image of vehicle movement. MTMCT…
In the domain of visual question answering (VQA), the Multi-Image Visual Question Answering (MIQA) remains a major hurdle. It entails generating pertinent and grounded responses to natural language prompts founded on a vast assortment of images. While large multimodal models (LMMs) have proven competent in single-image VQA, they falter when dealing with queries involving an…