Recent Favourite Reads - Newsletter 16.07.2023-11.09.2023

Main

  1. Curbing Connection Churn in Zuul

    10 min

    ;
  2. Interesting patterns to consume ZIO Streams

    6 min

    ;
  3. Weird Python "Features" That Might Catch You By Surprise

    5 min

    ;
  4. Things DBs Don't Do - But Should

    12 min

    ;

    Even though I completely agree with most of the points in the article, I would like to add a little context and address some of the wishes. To begin with, from time immemorial (since Oracle 8i), Oracle has Row Level Security with later applications in the form of Virtual Private Database and Real Application Security. All these technologies bring tenant awareness up to the next level and simplify the development process in the pooled model - no more missing "where" clauses.
    Another point is that a browser can talk to an Oracle database via HTTP-based protocol: SODA, REST-Enabled SQL and other techs are provided by a database. Edition-based redefinition allows engineers to have multiple versions of packages, aliases (and some other objects), which doesn't solve version control issues completely but simplifies it much. Oracle Scheduler supports job scheduling and even more advanced things such as job chains and prioritising. And so on. Many of the identified issues are partially or entirely resolved in the Oracle database ecosystem, leading to the whole thing turning into an incredible monster with a rather sharp learning curve and an inhumane price tag. It seems to me that the article contains various shortcomings of databases, which should rather be addressed in the field of standardising approaches to data platforms. There are already dozens of solutions for almost every point, and the problem is their fragmentation, lack of composition and widespread adoption. An additional risk is that all this leads to a dangerous temptation to start developing business logic. If everything you need is available out of the box, why complicate it? And I'm not against stored procedures - they can indeed save the day in some cases, but a separation of concerns is necessary for a variety of reasons.

  5. How I joined the bug 323 community

    10 min

    ;
  6. Why Does gRPC Insist on Trailers?

    7 min

    ;
  7. The road to running Apache Flink applications on AWS KDA

    8 min

    ;
  8. Data Streaming: Real-time data for real-time decisions (Palantir RFx Blog Series, #8)

    11 min

    ;
  9. Historic Algorithms Help Unlock Shortest-Path Problem Breakthrough

    8 min

    and paper mentioned: Negative-Weight Single-Source Shortest Paths in Near-linear Time and Maximum Flow and Minimum-Cost Flow in Almost-Linear Time;
  10. Anti-hype LLM reading list;
  11. Java 21: So How Should We Construct Strings Now?

    6 min

    ;
  12. Apache SeaTunnel and Paimon: Unleashing the Potential of Real-Time Data Warehousing

    18 min

    ;
  13. Leaving Haskell behind

    14 min

    ;
  14. Where’s My Data — A Unique Encounter with Flink Streaming’s Kinesis Connector

    12 min

    ;
  15. Technology Lifecycle

    10 min

    ;
  16. Scaling the Instagram Explore recommendations system

    10 min

    ;
  17. How Uber Optimized Cassandra Operations At Scale

    10 min

    ;
  18. A Flexible Framework for Effective Pair Programming

    11 min

    ;
  19. Metis: Building Airbnb’s Next Generation Data Management Platform

    8 min

    ;
  20. Two Erlang patterns I love

    8 min

    ;
  21. Debugging Crashes and Deadlocks in Python using PyStack

    7 min

    ;
  22. Software Engineering Culture Metrics

    8 min

    ;
  23. Microservices Retrospective – What We Learned (and Didn’t Learn) from Netflix

    25 min

    ;
  24. Applying SRE Principles to CI/CD

    6 min

    ;
  25. Under the hood: How we built API versioning for LinkedIn Marketing APIs

    6 min

    ;
  26. Analyzing Time Series for Pinterest Observability

    8 min

    ;
  27. Why You (Probably) Don’t Need to Fine-tune an LLM

    10 min

    ;
  28. Challenging algorithms and data structures every programmer should try

    4 min

    I'm not a big fan of catchy headlines like "N things every X should know/try", but this article is concise and inviting for discussion. I recommend following the comments here and cannot help but agree. Even if you're not going to implement all of these on your own, knowing that in advance could save you a considerable chunk of time just because you can recognise that you're facing a well-known problem with a dedicated algorithm or approach to solve. I have used topological sort in various applications, from validation complex/combined therapies in biotech context up to resolving dependencies graph for calculation expressions; Myers diff algorithm for comparison and automatic merge of annotated texts; simplex method for building field engineer schedules; binary expression tree for a small DSL for describing treatments; interval trees in balance management subsystem, annotation spans merging and so on. I can confidently say that all this significantly saved time and eliminated the need to come up with non-optimal and reinventing-the-wheel solutions on the fly without any sensible reason.

  29. Scala cake pattern

    5 min

    with a following criticism

    4 min

    ;
  30. Yes, you should test on production…

    12 min

    ;
  31. The EU’s Digital Services Act goes into effect today: here’s what that means

    4 min

    ;
  32. Rules of Thumb for Software Development Estimations

    15 min

    ;
  33. Farewell EC2-Classic, it’s been swell

    4 min

    ;
  34. Does your organization have a culture of mediocrity?

    8 min

    ;
  35. HashiCorp adopts Business Source License

    2 min

    with Open Source answer The OpenTF Manifesto and its latest announce on Terraform fork. Fork repo is available here;
  36. Zero Configuration Service Mesh with On-Demand Cluster Discovery

    8 min

    ;
  37. static final what?

    4 min

    ;
  38. The Dark Side of Sharding: Mistakes You Can’t Afford to Make

    12 min

    ;
  39. My mental model of transducers

    4 min

    ;
  40. Optimizing Postgres's Autovacuum for High-Churn Tables

    8 min

    ;
  41. Profile-guided optimization in Go 1.21

    12 min

    ;
  42. Why we always end up with waterfall

    3 min

    with comments;
  43. My Solution To The Dev Skills Crisis: Much Smaller Teams

    4 min

    ;
  44. AI, ML, and Data Engineering InfoQ Trends Report - September 2023

    7 min

    ;
  45. Exponential Value at Linear Cost

    4 min

    ;
  46. Highlights from Git 2.42

    12 min

    ;
  47. JEP 447: Statements before super() (Preview)

    12 min

    ;
  48. JEP draft: Computed Constants

    8 min

    ;
  49. JDK 21 G1/Parallel/Serial GC changes

    4 min

    ;
  50. This is the Beginning of the End of the N+1 Problem: Introducing Single Query Loading

    7 min

    ;
  51. (JVMLS) The Challenges of Introducing Virtual Threads to the Java Platform - Project Loom

    45 min

    ;
  52. (JVMLS) Continuations - Under the Covers

    35 min

    ;
  53. (JVMLS) Value Objects in Valhalla

    50 min

    ;

Misc

  1. Societies of strangers do not speak less complex languages

    30 min

    ;
  2. Engineering Serendipity: When Does Knowledge Sharing Lead to Knowledge Production?

    40 min

    ;
  3. Positive Association between Altitude and Suicide in 2584 U.S. Counties

    22 min

    ;
  4. Ask vs guess culture

    8 min

    ;
  5. Piranesi, Borges, and the Labyrinths of Time

    7 min

    ;
  6. The Psychological Depths of Rock-Paper-Scissors

    4 min

    ;
  7. How did "oxen" (plural of "ox") survive as the only plural form with the Old English plural ending -en?

    7 min

    ;
  8. Executive Function Theft

    12 min

    ;
  9. When tech says ‘no’

    6 min

    ;

Interesting projects

  1. Interactive Runbooks Built with Markdown;
  2. jacobin: A more than minimal JVM written in Go and capable of running Java 17 classes;
  3. micro: A modern and intuitive terminal-based text editor;
  4. Capacities: A studio for your mind;
  5. A collection of debugging stories;
  6. Async-Await support for Java

    Definitely not a production-ready code, but it's a nice demo of how concise async/await (and, in fact, emulation of coroutines) implementation on the top virtual threads could be. Based on that project PoC of async/await in Vert.x was created.

  7. JSON Schema Store;
  8. tRPC: Move fast and break nothing. End-to-end typesafe APIs made easy.;
  9. Legal Lullabies: Lull yourself to sleep with the soothing white noise of your favorite tech giant’s terms of service;
  10. Grammar generator app;
  11. Jujutsu: A Git-compatible DVCS that is both simple and powerful with slides;
  12. Scan of month: Coffee

    It's just so fun. I am not sure about the practicality of applying computed tomography to every other surrounding object, but it is so curious and even aesthetically pleasing that I can only support the project with both hands.

  13. little-rat: Small chrome extension to monitor (and optionally block) other extensions' network calls;
  14. EventCatalog: Discover, Explore and Document your Event Driven Architectures powered by Markdown;
  15. Tuple: The best remote pair programming app on macOS and Linux;
  16. RFC 3339 vs ISO 8601;
  17. DevTools Tips;
  18. Climate Change Tracker;
  19. School of SRE: curriculum for onboarding entry-level talents into the SRE role;
  20. Alexandria: A minimalistic cross-platform eBook reader;

Books

  1. The Architecture of Open Source Applications

    It is a fairly fundamental series of books that helps better understand how some large, complex or well-known open-source projects were designed and developed and the rationale behind these decisions. Some of the chapters are posted on the site to catch your interest. Definitely could recommend it as a learning material, evening reading for curious minds or an endless source of topics for a software engineering book club.