Tech and related writings
A future beyond discrete, turn-based LLM conversations.
A look at how LLMs learn, and what this says about their limitations.
Late-chunking might be the closest we've ever been to solving retrieval with text embeddings models. This post gets it working on very long documents.
This blog offers an initial, evolving taxonomy for LLM agents.
Fine-tuning a BERT embeddings model with QLoRA using unsloth and Sentence Transformers.