Skip to content Skip to sidebar Skip to footer

Data Science

The launch of sqlite-vec v0.1.0 has been announced. This movable vector database extension is compatible with SQLite, with the capacity to support a million 128-dimension vectors. It also supports binary quantization and includes an expansive selection of SDKs.

Alex Garcia recently released sqlite-vec v0.1.0, a SQLite extension written in C that brings powerful vector search capability to the SQLite database system. Available under the MIT/Apache-2.0 dual license, the extension pairs versatility with accessibility, making it a highly valuable tool for developers across different platforms and environments. The new sqlite-vec extension enables vector search functionality…

Read More

Stumpy: An Efficient and Extensible Python Tool for Contemporary Time Series Analysis

Time series data, used across sectors including finance, healthcare, and sensor networks, is of fundamental importance for tasks including anomaly detection, pattern discovery, and time series classification, informing crucial decision-making and risk management processes. Extracting useful trends and anomalies from this extensive data can be complex and often requires an immense amount of computational resources.…

Read More

Enhancing your RFM Model in BigQuery: A Guide

This article discusses the enhancement of an RFM (Recency, Frequency, Monetary) model in BigQuery for improved customer insights and relationship management. The RFM model is a straightforward and easily implementable tool that provides valuable insights into customer behavior by sorting customers into distinct groups such as Champions, Potential Loyalists, and those at risk of being…

Read More

Developing Artificial User Studies: Stimulating Personas & Independent Entities

The article outlines the process of creating synthetic user research using Autogen, an autonomous agent orchestration tool. The application of the Large Language Model (LLM) from OpenAI was explored with versions GPT-3.5 and GPT-4. The whole process starts with setting up the environment and creating the Autogen configuration, LLM, and API keys. The LLM instance has…

Read More

System Architecture: Bloom Filter. Ingeniously reconfiguring a hash table… | authored by Vyacheslav Efimov | March, 2024

A hash table is a foundational data structure known for its optimal performance for insertion, search, and deletion queries given a well-chosen hash function. However, hash tables can encounter issues such as potential collisions, which can slow down processes and require increased memory space to mitigate. A probabilistic data structure known as a Bloom filter can…

Read More

Four Impressive Data Engineering Projects That Enhance Your Resume | by 💡Mike Shakhomirov | March, 2024

Data engineering is a highly coveted field that demands a robust set of skills and knowledge. The perfect blend of software engineering, data analysis, and data platform architecture, it's a career path that can be complicated but equally fulfilling. In the following section, the ideas for four data engineering projects that enhance any CV and…

Read More

Comparing the Syntax of Polars and SQL: An In-depth Examination of a Polars Query by Ben Feifke, March 2024.

The blog post "Anatomy of a Polars Query: A Syntax Comparison of Polars vs SQL” by Ben Feifke discusses the transition from using the Pandas software library to Polars, a more recent addition to the data analysis field. Despite being marketed for use as a replacement for Pandas, Polars operates differently from its predecessor, resulting…

Read More

DIGITOUR: Automated Virtual Showcases for Property Real-Estate 🏠 | Authored by Prateek Chhikara | March, 2024

The DIGITOUR system is an end-to-end pipeline for creating digital tours of real-estate properties. It involves capturing 360-degree images in each area of a property, tagging each of these areas with bi-colored paper tags, and using machine learning algorithms to stitch together a coherent tour. To create a tour, an operator places paper tags at various…

Read More

What is Required to Pursue a Career in Data Engineering in 2024? | Authored by 💡Mike Shakhomirov | March, 2024

If you are considering a career move, particularly towards software engineering and database design, a prospect that might appeal to you is the growing field of data engineering. Whether your background lies in marketing, analytics, or finance, you can successfully transition into the data space and be part of an industry full of opportunities and…

Read More