The privacy of users participating in online communities is an imperative issue. Websites like Reddit allow users to post under pseudonyms to maintain anonymity; however, anonymity can lead to abusive behavior. In some instances, pseudonyms may not entirely assure privacy as a user's writing style can disclose their identity. These identifiable elements within a text,…
Generative AI models such as Large Language Models (LLMs) have proliferated over various industries, advancing the future of programming. Historically, the field of programming has been primarily governed by symbolic coding that unites traditional symbolic code and neural networks to solve specific tasks. Symbolic programming's backlash, however is that it often requires developers to manually…
Generative Artificial Intelligence (Gen AI) is leading to significant advancements in sectors such as science, economy, and education. At the same time, it also raises significant concerns that stem from its potential to produce robust content based on input. These advancements are leading to in-depth socio-technical studies to understand the profound implications and assessing risks…
The increasing demand for financial data analysis and management has propelled the expansion of question-answering (QA) systems powered by artificial intelligence (AI). These systems improve customer service, aid in risk management, and provide personalized stock recommendations, thus requiring a comprehensive understanding of financial data. This data's complexity, domain-specific terminology, market instability, and decision-making processes make…
The rapid growth of digital text in different languages and scripts presents significant challenges for natural language processing (NLP), particularly with transliterated data where performance often degrades. Current methods, such as pre-trained models like XLM-R and Glot500, are capable of handling text in original scripts but struggle with transliterated versions. This not only impacts their…
Brain-computer interfaces (BCIs), which enable direct communication between the brain and external devices, have significant potential in various sectors, including medical, entertainment, and communication. Decoding complex auditory data like music from non-invasive brain signals presents notable challenges, mostly due to the intricate nature of music and the requirement of advanced modeling techniques for accurate reconstruction…
The Python Testbed for Federated Learning Algorithms (PTB-FLA) is a low-code framework developed for the TaRDIS project of the EU Horizon 2020. With the intent to streamline the development of decentralized and distributed applications for edge systems, it is constructed in pure Python, allowing it to be lightweight and easily installed, specifically fitting for small…
Google AI researchers are working towards generating high-quality synthetic datasets while ensuring user privacy. The increasing reliance on large datasets for machine learning (ML) makes it essential to safeguard individuals' data. To resolve this, they use differentially private synthetic data, new datasets that are completely artificial yet embody key features of the original data.
Existing privacy-preserving…