Control Flow Abstraction

Allen Webster

#9529

November 24, 2016

So glad to see Odin up on the site now! A while back I promised you a write up on my number one feature request, and I've had it sitting in my email draft for a while, so I'll go ahead and put it up right now.

An example of the problem

Iterating over doubly linked lists is not hard, but to do it in C, there is some boilerplate to the for loop that can get repeated a lot:

1	for (Node node = list.head; node != 0; node = node->next){ / work on the node */ }

That is a small amount to type, but it's still annoying enough that a lot of programmers ask for a "foreach" construct in new languages. Sometimes it makes sense to make a macro that simplifies the boilerplate. Abstracting the boilerplate becomes particularly important when the loop happens in a lot of places, because if your singly linked list becomes a doubly linked list with a sentinel all the loop controls need to be rewritten:

1	for (Node node = list.sentinel.next; node != &list.sentinel; node = node->next){ / work on the node */ }

Again this is not a lot to type, but if you didn't abstract out how you iterate a linked list, and the for loop boilerplate is everywhere the retyping can be a big hassle and interrupt a train of thought.

A lot of people take this particular example very seriously and spend a lot of time working out a "foreach" system which involves defining an operator/procedure/type or something else that facilitates how to customize "foreach" to everyones own little data structure. Since the only way languages think about custom control flow is through the call stack, these systems always involve something that is ultimately a callback+userdata system. Even then this still only abstracts "foreach" type control flow, and, in my limited experience, is not better than making a macro even in the foreach case.

The idea
Since the user just wants to have the code

1	for (Node node = list.head; node != 0; node = node->next){ / work on the node */ }

written into their code, without using the call stack, in a way that encapsulates iteration rule for the list, I propose a language construct that acts as a typed macro replacement sort of system like the following:

#control_flow forlist($v : ^Node; $list : expr_const) ${B} :: {
    for (v = list.head; v != 0; v = v.next){
        B
    }
}

Then code that loops over a list could be written as:

1	forlist (node : ^Node; list) { /* work on the node */ }

Here is a break down of the ideas in this example:

1	#control_flow

marks this as the declaration of a new control flow structure.

forlist

is the name of the new control flow structure.

1	$v : ^Node;

specifies that it expects a variable declaration or the name of an existing variable on the first parameter.

1	$list : expr_const

specifies that it expects an expression with no side effects for the second parameter.

${B}

captures the first statement after the control flow parameter list.

The rest defines how to configure those pieces in terms of the already defined control flow structures.

A standard way to handle this would be to write a function that takes a callback and user data, and that does the loop code. That is quite similar to this, but it takes a lot more for the user to think in those terms, and I suspect the compiler probably struggles to optimize it as well as it could have optimized this.

Here is an example where control flow abstraction would allow an API writer to trust that the user will use the code correctly, when the API writer could not have made that assumption without control flow abstraction.

When providing a system for iterating through data in a chunk by chunk fashion, there are some things the author can do in the API to help the user keep their indexes straight. The bad way to do this is just report to the user the start and end positions of the chunk they are currently on, and force them to always subtract an offset from the indexes. A nice way to do it, is to provide an array pointer that is rebased so that even if the chunk ranges from indexes 900 to 1000, the user can just plug in index 950 to the array, and it has been shifted so that the absolute index lines up. The only issue left for the author is to help the user get the control flow right, and the best I have come up with in C is to transform all for loops via the following rule:

The Original:

1	for (A; i < end; ++i){ /* work */ }

The Transformed Version:

if (init_stream(^stream, i, end)){
    still_looping := 1;
    A;
    do{
        for (; i < stream.end; ++i){ /* work */ }
        still_looping = stream_move_forward(^stream);
    }while(still_looping);
}

This is not a trivial rewrite. The end condition has been moved to a completely different place, the pre-statement A has been relocated, there is an extra variable to declare, the whole thing is now two levels of nesting deeper than it should be, and not all of the changes are on the same side of the main work block, meaning you have to go back and forth across potentially lots of code to do this tranformation by hand.

My use for this pattern is all over my code so all this boilerplate has to be repeated because there is no way to abstract over control flow. If I try to ask end users to use this system, there will likely be many many bugs, so I am still telling everyone they should just read all of the data and handle it that way, even though the chunk system would solve their problem more efficiently, and almost just as easily if I could trust that the control flow would not become a problem.

Edited by Allen Webster on November 24, 2016, 9:11pm

ratchetfreak

#9530

November 24, 2016

"struggle to optimize" with the callback+userdata case is perhaps a bit strong.

If the hypothetical list_for_each call (which takes the function pointer and userdata gets inlined then it can go ahead and devirtualize the callback (basically constant fold function pointer into a static call) and then inline that function call.

Of course this all hinges on the compiler inlining that first function call. But a simple loop should be a good candidate to be inlined. And with the recent oop+small functions fad, compilers (at least llvm from the talks about it I've seen) really like to inline everything they can.

Allen Webster

#9531

November 24, 2016

Okay yes, I concede that optimization should be possible assuming all of the source code is provided to the compiler at once and it can see what function it will have to inline. I still don't like the semantic setup there.

Ginger Bill

#9532

November 24, 2016

Thank you for this detailed post, Allen.

Metaprogramming is what I want to tackle next and there are few types:

* Introspect and Reflection (Already implemented)
* Compile Time Execution
* Parametric Polymorphism ("Generics")
* Macros
* Templates

The latter is what you may be suggesting. I've been thinking about the very same problem and I think I may have a similar solution.
n.b. Syntax below is definitely not final but enough to get the gist

// Allen's Linked list control flow
for_list :: template(node: expr, list: ^Node) -> (body: stmt) {
	for node := list.head; node != nil; node = node.next {
		body
	}
}

for_list!(v, list) {
	// Whatever and Amen
}
// Expands to
for v := list.head; v != nil; v != v.next {
	// Whatever and Amen
}

// Wrapper code
DEBUG :: true
log :: template(msg: string, args: ..any) {
	if DEBUG {
		fmt.println(msg, ..args)
	}
}

log!("Hellope", "World")
// Expands to (if DEBUG == true)
fmt.println(msg, ..args)

// Pascal/Python style with
with_file :: template(f: expr, filename: string, mode: os.FileMode) -> (body: stmt) {
	fn := filename
	f, err := os.open(fn, mode)
	if err == os.Error.NONE {
		defer os.close(^f)

		body

	} else {
		fmt.println("Cannot open file: ", fn)
	}
}

with_file!(txt, "some_doc.txt", os.FileMode.WRITE) {
	fmt.fprintln(^txt, "Line 1")
	fmt.fprintln(^txt, "Line 2")
}

With this style of templates, it's very easy to generate any code you need whilst being safe (compared to C macros), and the code is still the same language (unlike C++ templates).

I don't know if this is the best solution for this kind metaprogramming. I want this language to remain small and simple so I'm not sure if I should implement everything into the language when possible.

Edited by Ginger Bill on November 24, 2016, 11:10pm

ratchetfreak

#9545

November 25, 2016

Mr4thDimention
Okay yes, I concede that optimization should be possible assuming all of the source code is provided to the compiler at once and it can see what function it will have to inline. I still don't like the semantic setup there.

I've seen that in D as part of the operator overloading called. It transforms the foreach body into a delegate (function+context point) and passes it into a member function opApply of the collection you want to iterate over.

To help differentiate break and continue the delegate returns an int that is 0 means continue iterating and if non 0 means break out of the loop and return that value (so labeled breaks can be differentiated). It's the first time I've seen internal looping like that as a language construct even if the semantics aren't the greatest.

One of the biggest downsides of it though is that the delegate's context is allocated on the heap and cleaned up by GC. Unless the opApply's delegate parameter is declared with "scope" or things get optimized as I described earlier.