Designing a URL-based query syntax for GraphQL

June 15, 2021

admin

Currently, if we want to use HTTP caching in GraphQL, we must use a GraphQL server that supports persisted queries. That’s because the persisted query will already have the GraphQL query stored in the server; as such, we do not need to provide this information in our request.

In order for GraphQL servers to also support HTTP caching via the single endpoint, the GraphQL query must be provided as a URL param. The GraphQL over HTTP specification will hopefully achieve this goal, providing a standardized language for all GraphQL clients, servers, and libraries to interact with each other.

I suspect, though, that all attempts to pass a GraphQL query via a URL param will be far from ideal. This is because a URL param must be provided as a single-line value, so the query will either need to be encoded or reformatted, making it difficult to understand (for us humans, not for machines).

For instance, this is how a GraphQL query looks when replacing all line breaks with spaces to make it fit within a single line:

{ posts(limit:5) { id title @titleCase excerpt @default(value:”No title”, condition:IS_EMPTY) author { name } tags { id name } comments(limit:3, order:”date|DESC”) { id date(format:”d/m/Y”) author { name } content } } }

Can you make sense of it? Me neither.

And this is how the GraphiQL client encodes the simple query { posts { id title } } as a URL param:

%7B%0A%20%20posts%20%7B%0A%20%20%20%20id%0A%20%20%20%20title%0A%20%20%7D%0A%7D

Once again, we don’t know what’s going on here.

Both these examples evince the issue: single-line GraphQL queries can work from a technical point of view, transmitting the information to the server, but it is not easy for people to read and write those queries.

Being able to operate with single-line queries would have many benefits. For instance, we could compose the query directly in the browser’s address bar, doing away with the need for some GraphQL client.

It is not that I dislike GraphQL clients — indeed, I love GraphiQL. But I do dislike the idea that I depend on them.

In other words, we could benefit from a query syntax that allows people to:

Write a query directly in a single line
Understand the single-line query at a glance

This is a formidable challenge. But it is not insurmountable.

In this article, I will introduce an alternative syntax, which supports being “easy to read and write in a single line” by us humans.

I am not really proposing introducing this syntax to GraphQL — I understand that would never happen. But the design process for this syntax can, nevertheless, exemplify what we must pay attention to when designing the GraphQL over HTTP specification.

Why is the GraphQL syntax so difficult to understand in a single line?

Let’s first explore what the issue is with the GraphQL syntax and then generalize it to other syntaxes.

Identifying the problem

As I see it, the difficulty comes from fields in a GraphQL query being nested, wherein the nesting can advance and retreat throughout the query. It is this coming-and-going behavior that makes it hard to grasp when written in a single line.

If the nesting in the query only advances, then it’s not so difficult to understand it. Take this query, for instance:

{
posts {
id
title
excerpt
comments {
id
date
content
author {
id
name
url
posts {
id
title
}
}
}
}
}

Here, the nesting only goes forward:

GraphQL query, advancing only.

When looking over the always-going-forward query, and scanning it from left to right, we can still understand to what entity every field belongs:

{ posts { id title excerpt comments { id date content author { id name url posts { id title } } } } }

Now, consider the same GraphQL query, but rearranging the fields so that leaves appear after connections:

{
posts {
id
comments {
id
date
author {
posts {
id
title
}
id
name
url
}
content
}
title
excerpt
}
}

In this case, we can say that fields advance and also retreat:

GraphQL query, advancing and retreating.

This query can be written in a single line, like this:

{ posts { id comments { id date author { posts { id title } id name url } content } title excerpt } }

Now, understanding the query is not so easy anymore. After a retreating level (i.e., right after a connection), we might not remember which entity came before it, so we won’t grasp where the field belongs:

To what entity do these fields belong to?

(I guess this is related to the human brain having a limited short-term memory, able to hold not more than a few items at a time.)

And when there are many levels of going forward and back, then it becomes quite impossible to fully grasp. This query is understandable:

{
posts {
id
comments {
id
date
children {
id
author {
name
url
}
content
}
author {
posts {
id
title
tags {
name
}
}
id
name
friends {
id
name
}
url
}
content
}
title
excerpt
}
author {
name
}
}

But there’s no way we can make sense of its single-line equivalent:

{ posts { id comments { id date children { id author { name url } content } author { posts { id title tags { name } } id name friends { id name } url } content } title excerpt } author { name } }

In conclusion, GraphQL queries cannot be easily represented in a single-line, in such a way that we humans can make sense of it, because of its nesting behavior.

Generalizing the problem

The issue is not specific to GraphQL. Indeed, it will happen for a syntax — any syntax — where the elements advance and retreat.

Take JSON, for instance:

{
“name”: “leoloso/PoP”,
“description”: “PoP monorepo”,
“repositories”: [
{
“type”: “package”,
“package”: {
“name”: “leoloso-pop-api-wp/newsletter-subscriptions-rest-endpoints”,
“version”: “master”,
“type”: “wordpress-plugin”,
“source”: {
“url”: “https://gist.github.com/leoloso/6588f6c1bdcce82fc317052616d3dfb4”,
“type”: “git”,
“reference”: “master”
}
}
},
{
“type”: “package”,
“package”: {
“name”: “leoloso-pop-api-wp/disable-user-edit-profile”,
“version”: “0.1.1”,
“type”: “wordpress-plugin”,
“source”: {
“url”: “https://gist.github.com/leoloso/4e367eb8d8014a7aa7580567608bd5b4”,
“type”: “git”,
“reference”: “master”
}
}
},
{
“type”: “vcs”,
“url”: “https://github.com/leoloso/wp-muplugin-loader.git”
}
],
“minimum-stability”: “dev”,
“prefer-stable”: true,
“require”: {
“php”: “~8.0”,
“getpop/api-rest”: “dev-master”,
“getpop/engine-wp-bootloader”: “dev-master”
},
“extra”: {
“branch-alias”: {
“dev-master”: “1.0-dev”
},
“installer-types”: [
“graphiql-client”,
“graphql-voyager”
],
“installer-paths”: {
“wordpress/wp-content/mu-plugins/{$name}/”: [
“type:wordpress-muplugin”
],
“wordpress/wp-content/plugins/{$name}/”: [
“type:wordpress-plugin”,
“getpop/engine-wp-bootloader”
]
}
},
“config”: {
“sort-packages”: true
}
}

Converting it to a single line makes it really difficult to comprehend:

{ “name”: “leoloso/PoP”, “description”: “PoP monorepo”, “repositories”: [ { “type”: “package”, “package”: { “name”: “leoloso-pop-api-wp/newsletter-subscriptions-rest-endpoints”, “version”: “master”, “type”: “wordpress-plugin”, “source”: { “url”: “https://gist.github.com/leoloso/6588f6c1bdcce82fc317052616d3dfb4”, “type”: “git”, “reference”: “master” } } }, { “type”: “package”, “package”: { “name”: “leoloso-pop-api-wp/disable-user-edit-profile”, “version”: “0.1.1”, “type”: “wordpress-plugin”, “source”: { “url”: “https://gist.github.com/leoloso/4e367eb8d8014a7aa7580567608bd5b4”, “type”: “git”, “reference”: “master” } } }, { “type”: “vcs”, “url”: “https://github.com/leoloso/wp-muplugin-loader.git” } ], “minimum-stability”: “dev”, “prefer-stable”: true, “require”: { “php”: “~8.0”, “getpop/api-rest”: “dev-master”, “getpop/engine-wp-bootloader”: “dev-master” }, “extra”: { “branch-alias”: { “dev-master”: “1.0-dev” }, “installer-types”: [ “graphiql-client”, “graphql-voyager” ], “installer-paths”: { “wordpress/wp-content/mu-plugins/{$name}/”: [ “type:wordpress-muplugin” ], “wordpress/wp-content/plugins/{$name}/”: [ “type:wordpress-plugin”, “getpop/engine-wp-bootloader” ] } }, “config”: { “sort-packages”: true } }

What’s more, when the syntax uses spacing to nest its elements, it won’t be even possible to write it in a single line.

That’s the case, for instance, with YAML:

services:
_defaults:
public: true
autowire: true
autoconfigure: true

PoPAPIPersistedQueriesPersistedQueryManagerInterface:
class: PoPAPIPersistedQueriesPersistedQueryManager

# Override the service
PoPComponentModelSchemaFieldQueryInterpreterInterface:
class: PoPAPISchemaFieldQueryInterpreter

PoPAPIHooks:
resource: ‘../src/Hooks/*’

Designing a different query syntax

I will describe the design for an alternative to the GraphQL syntax: the PQL syntax, used by GraphQL by PoP (the GraphQL server in PHP that I’ve authored) to accept URL-based queries passed via GET.

Since the problem with the GraphQL syntax arises from retreating nested fields, the solution seems evident: the flow of the query must be always-forward.

How does PQL achieve this? In order to demonstrate, let’s explore the PQL syntax.

Field syntax

In GraphQL, a field is written like this:

{
alias:fieldName(fieldArgs)@fieldDirective(directiveArgs)
}

In PQL, a field is written like this:

fieldName(fieldArgs)[@alias]<fieldDirective(directiveArgs)>

So it’s quite similar, but there are a few differences:

The alias is placed not before the field, but after the field
The alias is identified not with :, but with @ (and, optionally, surrounded by […] for “bookmarks,” explained later on)
The directive is identified not with @, but surrounded with <…>

These differences are directly related to the always-forward flow required for the query.

In my own experience, when writing queries directly in the browser’s address bar, I always think of the need for the alias after having written the field name, not before. Therefore, using the order as in GraphQL, I had to backtrack to that position (pressing the left arrow key), add the alias, and go back to the final position (pressing the right arrow key).

That was quite cumbersome. It made much more sense to place the alias after the field name, making it a natural flow.

When defining the alias after the field name, it doesn’t make sense anymore to use :. This symbol is used by GraphQL to have the JSON response respect the shape of the query. Once the order between field and alias is inverted, using @ seems a natural fit.

This, in turn, meant we couldn’t use @ to identify directives anymore. Instead, I chose a surrounding syntax <…> (e.g., <directiveName>) so that directives can also be nested (e.g., <directive1<directive2>>), making it possible for GraphQL by PoP to support the composable directives feature.

Fields

In GraphQL, we can add two or more fields by adding a space or line break between them:

{
foo
bar
}

In PQL, we use the character | to separate fields:

foo|bar

We can already visualize how the query is composed as a single line:

There are no {} chars
There are no spaces or line breaks

We can also appreciate that the query can be composed directly in the browser, passed via URL param query.

For instance, the URL to execute query id|__typename is: ${endpoint}?query=id|__typename.

Using DevTools, we can see how HTTP caching is supported for the GraphQL single endpoint:

HTTP caching for the GraphQL single endpoint.

For all queries demonstrated below, there will be a link Execute query in browser. Click on them to visualize how PQL works on an actual site in production.

Making queries visually appealing

Similar to GraphQL, newlines (and also spaces) add no semantic meaning. Thus, we can conveniently add line breaks to help visualize the query:

foo|
bar

When using Firefox, this query can be copied (from a text editor, a webpage, etc.) and pasted into the browser’s address bar, and all line breaks will be automatically removed, creating the equivalent single-line query.

Connections

GraphQL uses characters {} to define data for connections:

{
posts {
author {
id
}
}
}

In PQL, the query only advances, never retreats. So there is an equivalent for {, which is ., but there is no equivalent for } since it will not be needed.

posts.
author.
id

Why is the GraphQL syntax so difficult to understand in a single line?

Identifying the problem

Generalizing the problem

Designing a different query syntax

Field syntax

Fields

Making queries visually appealing

Connections

Syntax for an advancing-only flow

Bookmarks to remove verbosity

Simplifying field arguments

Variables

Fragments

Converting queries between GraphQL and PQL syntaxes

Converting the introspection query

Some more examples

Conclusion

Leave a Reply Cancel reply