Posts on Tom Phillips

Understanding Kotlin DSLs

Tue, 04 Mar 2025 10:02:35 +0000

I’m learning Kotlin for a new job and I was confused by type-safe builders. They’re used to create domain specific languages (DSLs) in Kotlin. Two common use cases are configuring routes in Ktor server or building HTML. I think they’re confusing because they combine a bunch of Kotlin concepts. Here’s how I broke it down to understand it.

Let’s start with a data class to describe a pet. Note the member properties are mutable and have default values. I’ll come back to why at the end.

data class Pet(var name: String? = null, var description: String? = null)

Next we need a builder function. It takes one argument, init, which is a function type with receiver. In other words, init takes no arguments, and operates in the context of a Pet instance, and returns Unit. The builder function returns an instance of Pet.

fun pet(init: Pet.() -> Unit): Pet {  
    val pet = Pet()  
    pet.init()  
    return pet  
}

You use it to create an instance of Pet like this:

var jessica = pet {  
    name = "Jessica"  
    description = "Black and white cat"  
}

There are two confusing things going on here.

First, there’s no () calling the pet function. Instead, there’s a trailing lambda. In Kotlin, when the last argument to a function is a function, then you can omit the parentheses and pass the function as a lambda expression in curly brackets.

Second, how do the assignments to name and description end up as member properties of jessica? Previously I said init is a function type with receiver, and inside these types of function, the receiving object can be accessed via this. But where’s the this? You don’t need it because Kotlin infers that name is actually this.name.

We can then nest another class inside Pet. Let’s add information about the Owner of the pet.

data class Owner(var name: String? = null)  
  
data class Pet(var name: String? = null, var description: String? = null, val owners: MutableList<Owner> = mutableListOf<Owner>()) {  
    fun owner(init: Owner.() -> Unit): Owner {  
        val owner = Owner()  
        owner.init()  
        owners.add(owner)  
        return owner  
    }  
}

We’ve added a member property called owners and a member function called owner. This is another builder, like pet, that creates an instance of Owner, calls init, then adds the owner to the pet’s list of owners, and returns the owner.

Put this altogether to get our DSL:

var jessica = pet {  
    name = "Jessica"  
    description = "Black and white moggy"  
    owner {  
        name = "Tom"  
    }  
    owner {  
        name = "Kelly"  
    }  
}

// println(jessica) returns:
// Pet(name=Jessica, description=Black and white moggy, owners=[Owner(name=Tom), Owner(name=Kelly)])

When I defined Pet I noted that the member properties must be mutable and have default values. You need defaults because the builder functions create an instance without passing any arguments to the constructor. Then because init sets the member properties, they must be mutable too. The exception is Pet.owners, because we don’t want to reassign the (mutable) list reference, just modify the contents of it.

Here’s the code in Kotlin Playground.

Understanding Kotlin DSLs by Tom Phillips is licensed under CC BY 4.0

Leaving iPhone after 16 years for Android

Sun, 02 Mar 2025 00:00:00 +0000

For 16 years I’ve used an iPhone. I’ve owned maybe 4-5 models and my current one is an iPhone 11. The battery is knackered and I started to think what I’d do if I needed to replace it. iPhones are getting bigger and more expensive. The cheapest model, the iPhone 16e, starts at £599. I don’t want to spend that much money on a phone.

Lately I’ve become uncomfortable with Apple’s control over what I can do on my phone. If it’s not on the App Store, then I can’t run it, despite having bought the hardware. I used to find genuinely useful apps, but I can’t remember the last time I downloaded something like that. Nowadays it’s full of low-quality apps designed to extract micro-transactions from users, including children. Apple makes a ton of money from their commission.

Apple has the ability to remove apps from the App Store, including ones already purchased and downloaded, so oppressive regimes can force Apple to remove apps that they don’t like. In the UK, Apple has disabled end-to-end encryption for iCloud data, so the government – or whoever manages to compromise Apple’s key management – can access the data of users in iCloud, including all device backups. This is hugely invasive. I can’t even switch to another provider because Apple locks iOS down.

Let’s not forget Apple Intelligence: I don’t want generative AI crap on my phone or computer.

Overall, I don’t trust American tech companies to do the right thing anymore. They operate in an authoritarian regime. I can only see the situation getting worse.

So a week ago I switched to a Pixel 8a running GrapheneOS.

GrapheneOS is a privacy and security focussed mobile operating system based on the Android Open Source Project. One of its features is sandboxed Google Play, so you can download things like banking apps that don’t work or are unreliable on other Android OS like /e/os and LineageOS.

I chose a Pixel 8a because it is a device recommended by GrapheneOS and comes with 7 years of support from launch (May 2024). Wirecutter also recommends it as their budget pick. It’s also well-priced. I paid £349. I’d have preferred a Fairphone over a Google device, but they’re more expensive and unsupported by GrapheneOS.

Installing GrapheneOS via WebUSB was easy (and really cool). Overall, I’ve found it intuitive and easy to use. Perhaps it’s slightly less polished than iOS, but it’s fine.

It was easy to find replacement for apps. I already use Fastmail, so I downloaded their app for my email, calendars and contacts. I downloaded DAVx5 to sync my Fastmail contacts to the Contacts app. For now I have some calendars shared with my partner on iCloud. I’m accessing those via Fastmail too.

I simply stopped using iMessage. Everyone uses WhatsApp or, better, Signal anyway.

I replaced iCloud Photos with Ente. It uses end-to-end encryption. I’ve not uploaded my whole library yet, but it’s working fine so far.

I swapped Overcast for AntennaPod. I have noticed the absence of volume normalization, but I can live without it.

I’ve used Things to manage todo lists since ~2008. But really I have simple needs. So I’m trying out a plaintext todo list in Obsidian, with end-to-end encrypted sync via Obsidian Sync, and simply putting date-sensitive items in my calendar.

The only niggle so far was some lag on the GrapheneOS camera app when pressing the shutter button. I downloaded the Google Camera app and disabled network access. It’s much faster.

I’ve not tried out device backups yet, but since my data is either in Fastmail, Ente, 1Password or Obsidian, I feel comfortable with the risk. I might try backing up to a USB stick.

I installed Obsidian and Tor via .apk files. You just download and install it! I got Organic Maps from the Accrescent store. I might try out Obtainium.

Overall, I’m really happy. It’s been much easier than I expected to switch. I’ve probably spent 3-4 hours on it in total. Go for it!

Leaving iPhone after 16 years for Android by Tom Phillips is licensed under CC BY 4.0

Learning Clojure: Vim REPL integration

Sat, 31 Aug 2024 21:13:29 +0100

In my last post I wrote that I was using the Cursive plugin for IntelliJ. It turned out that it doesn’t play nicely with the Ideavim plugin. I considered mapping keystrokes to Cursive actions, but that’s not an activity I enjoy and I doubted whether my keybindings would be any good.

Emacs is popular in the Clojure community, but I use Vim and I’m not willing to change editor to learn a new language. So I wanted a Vim plugin for REPL-driven development.

The hosts of the Clojure Design Podcast use Tim Pope’s vim-fireplace, so I tried it out. You start a REPL with lein repl, which writes the port to a .nrepl-port file. vim-fireplace reads it and connects to the REPL. It worked as the README described but the keybindings didn’t click for me.

After spotting the podcast was from 2019 and searching for something newer, I found Conjure. In fact one of the hosts blogged about switching to Conjure in 2023. Conjure is very interesting. It’s written in Fennel, a Lisp that compiles to Lua (so it’s for Neovim, not Vim). Conjure supports REPL-driven development for a bunch of different Lisps and somehow even Python, which I need to try.

Here I wasted loads of time reluctantly fiddling with Vim configs. I should definitely have tried Conjure out without installing. You should try this first!

Long story short: I didn’t want to move my Python development from PyCharm to Vim, so I settled on three vim config files: separate Ideavim and Neovim configs that both source a common config file of common preferences and keybindings. Ideavim supports a small number of plugins via vim-plug, so I figured it was easiest to use it in Neovim too.

Luckily, after all the fiddling, the Conjure keybindings made sense to me. Type <localleader>e to “evaluate” then pick the thing to evaluate with another key:

,eb – evaluate entire file
,ee – evaluate inner form under cursor
,er – evaluate outer most (root) form under cursor
,ew – evaluate word, e.g. to peek in a var

I like it!

Namespaces

Here’s something I found confusing.

I’ve got a project I started with Leinigen. I start up the REPL with lein repl and type (doc frequencies). It prints out the documentation for frequencies. Evaluating *ns* tells me I’m in the noughts-and-crosses.core namespace, but somehow Clojure still resolves doc from the clojure.repl namespace.

Now I open up src/noughts_and_crosses/model.clj in vim. At the start of the file the namespace is defined with (ns noughts-and-crosses.model). I type (doc frequencies) into the file then evaluate it (,ee) and Clojure complains it can’t resolve doc! Evaluating *ns* tells me I’m in the noughts-and-crosses.model namespace, so something is switching me into the file’s namespace, but isn’t loading doc from clojure.repl.

I couldn’t get to the bottom of why this happens. I suppose the Leinigen REPL loads a bunch of namespaces for convenience.

Learning Clojure: Vim REPL integration by Tom Phillips is licensed under CC BY 4.0

Learning Clojure: first steps

Wed, 28 Aug 2024 00:00:00 +0000

Why Clojure?

It’s been longer than I’d like since I learnt a new programming language. I mostly write Python at work and I wanted to learn something substantially different. I narrowed it down to Elixir or Clojure. I chose Clojure for a few reasons:

I’ve never learnt a Lisp before.
Clojure runs on the JVM, so you get Java interoperability. At work I’ve been thinking about Kafka, which has much better support for Java compared to Python, so this is appealing.
The author of Clojure, Rich Hickey, is hugely influential and I enjoy his talks. Simple Made Easy is a favourite of mine.
Elixir seems more focussed on highly concurrent applications, which I generally don’t write, so I thought Clojure will be more useful to me.
My manager sung Clojure’s praises.

Inspired by Julia Evans I decided to write down and share what I have learned and struggled with so far.

I like learning by reading. I dipped in and out of the Clojure getting started guide, Clojure for the Brave and True and Programming Clojure. I wasn’t keen on the style of “Brave Clojure”, but I did enjoy the intro to projects and namespaces. It introduces the project automation tool Leinigen, which seems a bit like Pipenv or Poetry in Python. When I get started with a new language I like to figure out early on the idiomatic way to organise projects, so with lein new app my-project I was sorted.

As a card-carrying member of the test-driven development (TDD) club I started trying to figure out how to write tests. I took a look at the tests in xtdb and clojure.test docs. I can’t remember how, but I discovered that most Clojure developers don’t do TDD like you would with pytest or go test, but they use “REPL-driven development”. I enjoyed Sean Corfield’s talk on the topic.

Sean used VSCode. I like PyCharm with IdeaVim, so I downloaded IntelliJ and the Cursive plugin for Clojure. Starting a REPL is somewhat buried but once you have it running you can evaluate the form under your cursor with cmd+shift+p.

I thought I had gone mad when my IntelliJ wouldn’t let me delete parentheses. It turned out something called paredit was enabled, which enables structural editing. Structural editing takes advantage of Clojure code itself being data (an idea I’m not convinced I understand on a deep level) and all the nested forms ((...), {...}, [...]) forming a tree. But being a beginner I turned it off so I could delete my unwanted bracket and be on my way. It might have been to do with Cursive and IdeaVim not working well together.

Turning off structural editing turned out to be a poor decision and something I’d switch back on, but that’s another post.

Learning Clojure: first steps by Tom Phillips is licensed under CC BY 4.0

What do you learn from your MVP?

Fri, 14 Jun 2024 00:00:00 +0000

Minimum viable product (MVP) has become meaningless business jargon.

The problem with so-called MVPs is that no one learns anything from them. There is no hypothesis under test.

If your MVP doesn’t unambiguously validate or invalidate a hypothesis, then it’s impossible to know whether you are succeeding or failing.

What do you know about your customers? What problems do they have? How do they get value from your product? Will the answers to these questions change if you build the MVP? If they don’t change, then whatever you’re building is a waste of time and money, and it’s not an MVP.

In The Lean Startup (first published in 2011!), Eric Ries writes that an MVP completes one cycle of the build, measure, learn loop.

If we’re honest, most faux MVPs are just collective agreement on what is a barely acceptable use of everyone’s time. Something to keep people busy. A minimum tolerable output.

So next time someone says “it’s an MVP”, ask “what are we learning from it?”.

What do you learn from your MVP? by Tom Phillips is licensed under CC BY-SA 4.0

Will dbt adopt a proprietary licence? I think so

Tue, 02 Apr 2024 00:00:00 +0000

Data build tool (dbt) changed the way many organisations transform data and do analytics. I think more change is afoot: I predict dbt Labs will adopt a proprietary licence for dbt Core and the project will fork.

dbt Labs use the loose open core business model, where the primary functionality in dbt Core is covered under an open source licence (Apache 2.0) with extra proprietary software wrapping around it, namely dbt Cloud.

The open core creates a large funnel of potential customers for the proprietary product. The difficulty with the open core business model lies in deciding whether a feature goes in the open core or proprietary product. The company wants to add functionality to the proprietary product to drive conversion of users to customers, but this often conflicts with the interests of the users of the open core. Despite this conflict of interest, the open core depends on the success of the the proprietary product to fund its development.

In dbt’s case, I think dbt Cloud isn’t a compelling product. Despite using dbt Core for years I have never bought dbt Cloud. I don’t know of any organisations who use it; I know of one who stopped.

A recent job advert says dbt Labs have over 4,100 dbt Cloud customers. At $100/month that’s ~$5M/year. Maybe they have some big enterprise deals, but that revenue is probably dwarfed by payroll. LinkedIn lists 421 members associated with dbt Labs. A conservative estimate of 300 employees at $100k/year is $30M/year, which would be a loss of ~$25M/year. Not great for an eight year old company valued at $4.2B

What can dbt Labs do? They already cut headcount by 15% and switched to consumption-based pricing. Can they ship some compelling features? I’m not convinced. Glassdoor reviews (taken with a large pinch of salt) complain about leadership and the product roadmap. In an interview last December, their CEO Tristan Handy said:

[W]hat I want to do is increasingly make living in the dbt ecosystem feel like living in the Apple ecosystem, which is to say that the hard stuff [like CI, observability, data cataloging] just kind of vanishes into the background. And you forget that that was ever a problem and you spend all your time thinking about business value.

This is a nice idea but it will be hard to execute. He acknowledged that comparing dbt Labs with Apple is a stretch, but putting that aside, the revenue models are completely different: people pay Apple $1000 for an iPhone and then pay even more for services (to the EU’s chagrin). In contrast, entry into the dbt ecosystem is free and users spend money elsewhere (e.g. on a cloud data warehouse or managed data transfer services).

By Tristan’s own admission, and unlike Apple, dbt Labs’ products aren’t even best-in-class:

So we just launched this thing called dbt Explorer. And you look at dbt Explorer and you compare it to a fully featured data catalogue … and I mean, gosh, it’s like David and Goliath, like, our product is kind of a toy. But it does a lot of things, the basic things that you want, and it is automatically in the hands of everybody who is in the dbt ecosystem. And that’s really powerful. And so we want to continue to do that, we’re not going to try to build the most complex best-in-class orchestration tool or best-in-class observer. But when you have a huge user group, and you can say, hey, here’s a really useful thing, and it works with all these other things, then you get to kind of raise the bar on the set of tooling that everyone has access to.

I can’t see how this will lead to a sustainable open source business. Even if they do convert a fraction of their users to customers, these customers are then at risk of churning to a more sophisticated product in the future.

What might happen next? dbt Labs coined “analytics engineering” and if you’re analytics engineer practicing what dbt Labs preach, you probably have a lot of models. What surprises me is that 5% of the dbt install base has over 5000 models! These customers are sitting ducks, locked-in to dbt.

How can dbt Labs get money from those users? By pulling the same move as HashiCorp and switching new releases of dbt Core to the Business Source License (BuSL), which would block competitors offering products based on it.

dbt Labs then release new features to tame huge dbt projects now efficiency and cost optimisation are in vogue and perhaps they will start to see more customers. Meanwhile, they can claim to be committed to open source, despite the BuSL not being an OSI-approved licence.

Coincidentally, HashiCorp’s former chief revenue officer Brandon Sweeney is now dbt Labs’ chief operating officer.

If this does happen, a fork of dbt core is in the interests of all the vendors selling “dbt compatible” products, much like businesses who got together to create the OpenTofu fork of Terraform. I think it would be good for users too. I am surprised by how long dbt has dominated the transformation part of the data stack. More competition and diversity of thought in the data industry is good for everyone.

I think dbt Labs should have stuck with their original consultancy business Fishtown Analytics and made dbt a free software product, charging their clients for a proprietary distribution. But that business model probably wouldn’t go down well with their investors. Unfortunately I see dbt ending up being yet another example of the consequences of choosing the wrong open source business model.

Will dbt adopt a proprietary licence? I think so by Tom Phillips is licensed under CC BY 4.0