RAG vs. REFRAG: Process Flows and Differences

RAG Retrieval-Augmented Generation (RAG) is nowadays a common method to connect Large Language Models with external knowledge. Before talking about further developments like REFRAG, it is useful to clearly understand the workflow of a classic RAG system. The diagram below shows the standard process. The steps are color-coded, numbered and can be well divided into four areas: RAG Process Description Steps 1 - 4: Preprocessing (data preparation) Steps 5 - 7: Embedding ...

November 19, 2025 · Aaron

Install Docker Engine on Debian 12 Bookworm

This guide describes how to install the Docker Engine on Debian 12 (Bookworm) and configure it so that it starts automatically as a system service and remains updatable via APT. System update Before installation, a complete update of all existing packages is recommended to avoid version conflicts. Required tools ca-certificates provides the certificates of trusted issuers. curl is used to download the Docker GPG key. gnupg converts the key into a format readable by APT. ...

November 1, 2025 · Aaron

My Musical Experiment: An AI-Generated Salsa Album

Normally I write about technology, but today it’s about an experiment at the intersection of technology and creativity. I created a complete salsa album using various AI tools. The goal was to see how an AI processes the rhythmic and cultural nuances of salsa. The result is now available on all major streaming platforms, including Spotify, Apple Music and Amazon Music. Spotify https://open.spotify.com/album/2dX9EB9tMiS3AB8m9HBydR Apple Music https://music.apple.com/de/album/la-vida-en-clave/1840664275 Amazon Music https://www.amazon.de/dp/B0FR1J34QR/ Enjoy listening! https://www.thawalib-parabek.sch.id/wp-content/uploads/2019/06/pahami-pola-rahasia-ini-banyak-pemain-dapat-maxwin-lebih-cepat-tanpa-modal-besar.html https://www.thawalib-parabek.sch.id/wp-content/uploads/2019/06/cara-membaca-pergerakan-pola-sebelum-spin-ini-yang-jarang-dipelajari-pemain-pemula.html ...

October 15, 2025 · Aaron

AI-based Predictive Maintenance

The following podcast was generated by AI. Introduction In modern industrial manufacturing, predictive maintenance, or Predictive Maintenance (PM), has become a decisive competitive factor. Thanks to rapid advances in Artificial Intelligence (AI), companies can not only monitor the condition of their equipment but also precisely predict its future failure behavior. This technological revolution is built on three inseparable pillars. Firstly, the data serves as the raw material. In the next step, it is transformed into valuable insights through anomaly detection and model training. Finally, AI uses these insights to make reliable predictions. Understanding these interrelationships is the first and most important step on the path to intelligent maintenance. ...

September 22, 2025 · Aaron

Telegram Bot API Tutorial

Telegram is more than just a messaging app. It has evolved into a versatile platform for automation and the development of interactive services. Especially in the field of AI applications and tools like n8n, Telegram is becoming increasingly popular because it enables direct bidirectional communication with people. For developers who want to leverage these possibilities, there are two fundamental approaches: the Bot API and the MTProto-API. The Bot API: The standard way for most bots ...

September 15, 2025 · Aaron

Web Application Firewall (WAF): A Practical Guide

This guide covers the function, components and practical implementation of a Web Application Firewall (WAF). Definition of a Web Application Firewall A Web Application Firewall (WAF) is a security system for monitoring, filtering and blocking HTTP traffic to and from a web application. Unlike a traditional network firewall, which operates at the network and transport layers (Layer 3 and 4) of the OSI model and filters traffic primarily based on IP addresses and ports, a WAF operates at the application layer (Layer 7). ...

September 8, 2025 · Aaron

AsciiDoc & Kroki: a proven combination for clear software documentation

Software documentation today is more than just a mandatory task. It is part of the development process and must be as agile, versionable and automatable as source code itself. Classic office documents or proprietary wikis quickly reach their limits. Changes are hard to trace, collaboration is cumbersome and integration into CI/CD pipelines is hardly possible. AsciiDoc and Kroki address exactly these points. Both tools embody the Docs-as-Code approach, where documentation is treated like source code. AsciiDoc provides an expressive text-based language for technical documentation that can be managed in Git and processed automatically. Kroki complements this with the ability to embed diagrams directly in the document and render them automatically. This allows developers to use diagrams without needing to install additional tools. ...

September 6, 2025 · Aaron

Data sovereignty as the key to successful AI

Introduction: Why control over data determines AI success Artificial intelligence does not emerge in a vacuum, it relies on data as fuel. A neural network without a broad and high-quality data foundation can neither process language nor identify objects or derive meaningful recommendations. For companies, this results in a clear consequence: whoever loses control over their own data leaves the crude oil of the digital economy to external platforms. The concept of data sovereignty describes exactly this factual and legal dominion over one’s own data holdings and goes far beyond classical data protection. While data protection primarily aims to safeguard individuals’ fundamental rights by protecting their information, data sovereignty represents a strategic and economic question. It is about who owns data and who is allowed to use it in which way. This determines whether data becomes a valuable competitive advantage or flows unnoticed into the value chains of others. ...

September 3, 2025 · Aaron

Machine Learning Methods Mindmap

In machine learning there are many paradigms and methods from supervised and unsupervised learning through semi and self supervised to transfer learning and reinforcement learning. To keep an overview I have created a mindmap as a memory aid. On the first level are the learning types such as supervised learning, then the task areas, followed by the thematic subgroups and finally concrete methods. The visualization makes clear how broad the field is and facilitates the classification of the individual methods. It shows only a subset of the possibilities. Many other methods exist, I have focused here on the most frequently used and established in practice methods. ...

August 31, 2025 · Aaron

Part 2: Strategies for Better Results with RAG

In Part 1 we saw how crucial clean document preparation and thoughtful chunking are to the quality of Retrieval Augmented Generation. These basics form the starting point for a whole range of further optimizations that shape the entire process. In Part 2 we continue the series and focus on the next building blocks that build on this foundation and further develop the use of RAG in the enterprise. Embedding Domain-specific Embeddings Domain-specific embeddings mean that vector representations of texts are not generated with generally trained embedding models, but with models adapted to the technical language and content of a specific industry or company. General models are trained on very large, unspecific text corpora, including books, websites, Wikipedia and other sources. They understand everyday language and many standard concepts, but often miss the nuances in, for example, legal contracts, technical manuals or medical reports. Domain-specific embeddings are created either by fine-tuning an existing model with data from the respective domain or by training a custom model on a corpus of internal documents, guidelines, protocols and manuals. ...

August 28, 2025 · Aaron