
Streamlining Data Handling in PyTorch: Building an Efficient Data Pipeline
Introduction In the realm of machine learning, managing large datasets efficiently is often a critical task. PyTorch, known for its flexibility and ease of use,

Introduction In the realm of machine learning, managing large datasets efficiently is often a critical task. PyTorch, known for its flexibility and ease of use,

The dataset is a rich compilation of recipes, spanning a wide range of cuisines and styles. It offers a unique perspective on what makes a recipe more than just a list of ingredients and steps. With over 500k recipes, it’s a deep dive into the culinary world, providing data enthusiasts, chefs, and food bloggers an opportunity to analyze and understand cooking trends on a macro scale.

Voice conversion with a Keras autoencoder model

The famous QR factorization algorithm can be BLAS-3 optimized. Using the Woodbury matrix identity, we implement a block-based Rank-Revealing QR and point out a connection between pivot selection and object detection.

You want to identify the breaker switch for a particular outlet. This problem inspired me to write an article on Medium one day as I

Many strategies in Machine Learning involve the iterative search of a solution space Ω. We begin with an initial solution ω₀ and update ωₙ to minimize an objective function f(x)

There are k! substitution ciphers for an alphabet with k letters—too many for an exhaustive search. With a frequency-based approach adapted to the graph of

In Spring of 2019 my Environmental Fieldwork class surveilled the herbaceous plants growing on and around the Tufts campus, recording their identities and locations into

Vector search using Celery and LangChain where OpenAI embedding clients live inside Celery workers. Startup signals and lazy resource loading.

A look at adding vector search to a Django service with pgVector and OpenAI embeddings, how Celery keeps embedding workloads scalable.