Richard Smith 752692e08d Factor out a common base class from function and bound method values. (#2622) 3 lat temu
..
BUILD 9df70fb115 Disable most tracing in the prelude. (#2616) 3 lat temu
README.md 873e78f605 cleanup and docs for tracing output (#1320) 3 lat temu
action.cpp 7222f349d6 Explorer: move deallocation to `StepCleanUp` (#2547) 3 lat temu
action.h 7222f349d6 Explorer: move deallocation to `StepCleanUp` (#2547) 3 lat temu
action_stack.cpp 245e4d1ea6 Explorer: drop unused `PrintScopes` method (#2588) 3 lat temu
action_stack.h 245e4d1ea6 Explorer: drop unused `PrintScopes` method (#2588) 3 lat temu
address.h 563768c6d3 Explorer: split `Member` into dedicated classes (#2421) 3 lat temu
builtins.cpp 4fa71e32f5 Add support for most kinds of declarations to be declared and used as namespace members (#2575) 3 lat temu
builtins.h 2fd7e2b65d Add support for compound assignment and increment (#2526) 3 lat temu
dictionary.h 20728dbd3a CARBON_ header guards (#1261) 4 lat temu
element_path.h 47a1e99b9e Factor `ValueNodeView` out of `StaticScope`. (#2562) 3 lat temu
exec_program.cpp 9df70fb115 Disable most tracing in the prelude. (#2616) 3 lat temu
exec_program.h 9df70fb115 Disable most tracing in the prelude. (#2616) 3 lat temu
heap.cpp 563768c6d3 Explorer: split `Member` into dedicated classes (#2421) 3 lat temu
heap.h cef93fba5e Feature call destructor for tuples and bug fixes for the destructor process (#2255) 3 lat temu
heap_allocation_interface.h cef93fba5e Feature call destructor for tuples and bug fixes for the destructor process (#2255) 3 lat temu
impl_scope.cpp 0e41c569b1 Implement the termination algorithm for impl selection described in #2458 (#2602) 3 lat temu
impl_scope.h 0e41c569b1 Implement the termination algorithm for impl selection described in #2458 (#2602) 3 lat temu
interpreter.cpp 752692e08d Factor out a common base class from function and bound method values. (#2622) 3 lat temu
interpreter.h 9df70fb115 Disable most tracing in the prelude. (#2616) 3 lat temu
matching_impl_set.cpp 81532c8918 Fix mishandling of struct and tuple types in impl matching termination check. (#2609) 3 lat temu
matching_impl_set.h 81532c8918 Fix mishandling of struct and tuple types in impl matching termination check. (#2609) 3 lat temu
pattern_analysis.cpp 6037f9e622 Remove AlternativeValueBase, per review comments on #2605. (#2606) 3 lat temu
pattern_analysis.h 4d522c8e90 Finish making clang-tidy (mostly) work (again) and run -fix (#2312) 3 lat temu
resolve_control_flow.cpp 60af531f5b Support parsing namespace declarations (#2563) 3 lat temu
resolve_control_flow.h 20728dbd3a CARBON_ header guards (#1261) 4 lat temu
resolve_names.cpp 6037f9e622 Remove AlternativeValueBase, per review comments on #2605. (#2606) 3 lat temu
resolve_names.h 20728dbd3a CARBON_ header guards (#1261) 4 lat temu
resolve_unformed.cpp 60af531f5b Support parsing namespace declarations (#2563) 3 lat temu
resolve_unformed.h 47a1e99b9e Factor `ValueNodeView` out of `StaticScope`. (#2562) 3 lat temu
stack.h 20728dbd3a CARBON_ header guards (#1261) 4 lat temu
trace_stream.h 9df70fb115 Disable most tracing in the prelude. (#2616) 3 lat temu
type_checker.cpp 752692e08d Factor out a common base class from function and bound method values. (#2622) 3 lat temu
type_checker.h 9df70fb115 Disable most tracing in the prelude. (#2616) 3 lat temu
value.cpp 6037f9e622 Remove AlternativeValueBase, per review comments on #2605. (#2606) 3 lat temu
value.h 752692e08d Factor out a common base class from function and bound method values. (#2622) 3 lat temu
value_kinds.def 60af531f5b Support parsing namespace declarations (#2563) 3 lat temu
value_transform.h 4e1b585fcf clang-tidy --fix (#2577) 3 lat temu

README.md

explorer execution

The code in this directory defines all phases of program execution after parsing, including typechecking, name resolution, and execution. The overall flow can be found in ExecProgram, which executes each phase in sequence.

The Carbon abstract machine

Execution is specified in terms of an abstract machine, which executes minimal program steps in a loop until the program terminates. The state of the Carbon program (including the stack as well as the heap) is represented explicitly in C++ data structures, rather than implicitly in the C++ call stack. The Interpreter class represents an instance of the abstract machine, and is responsible for maintaining those data structures and implementing the steps of abstract machine execution.

The control-flow state of the abstract machine is encapsulated in an ActionStack object, which a represents a stack of Actions. An Action represents a self-contained computation (such as evaluation of an expression or execution of a statement) as a state machine, and the abstract machine proceeds by repeatedly executing the next state transition of the Action at the top of the stack. Executing a step may modify the internal state of the Action, and may also modify the Action stack, for example by pushing a new Action onto it. When an Action is done executing, it can optionally produce a value as its result, which is made available to the Action below it on the stack.

Carbon values are represented as Value objects, both at compile time and at run time. Note that in Carbon, a type is a kind of value, so types are represented as Values. More subtly, Value can also represent information that isn't a true Carbon value, but needs to be propagated through channels that use Value. Most notably, certain kinds of Value are used to represent the result of "evaluating" a Pattern, which evaluates all the subexpressions nested within it, while preserving the structure of the non-expression parts for use in pattern matching.

Values are always immutable. The abstract machine's mutable memory is represented using the Heap class, which is essentially a mapping of Addresses to Values.

Example

To evaluate the expression ((1 + 2) + 4), the interpreter starts by pushing an Action onto the stack that corresponds to the whole expression:

((1 + 2) + 4) .0. ## ...

In this notation, we're expressing the stack as a sequence of Actions separated by ##, with the top at the left, and representing each Action as the expression it evaluates, followed by its state. An Action consists of:

  • The syntax for the part of the program being executed, in this case ((1 + 2) + 4).
  • An integer pos for position, which is initially 0 and usually counts the number of steps executed. Here that's denoted with a number between two periods.
  • A vector results, which collects the results of any sub-Actions spawned by the Action. Above the results are omitted because they are currently empty.
  • A scope mapping variables to their values, for those variables whose lifetimes are associated with this action.

Then the interpreter proceeds by repeatedly taking the next step of the Action at the top of the stack. For expression Actions, pos typically identifies the operand that the next step should begin evaluation of. In this case, that operand is the expression (1 + 2), so we push a new Action onto the stack, and increment pos on the old one:

(1 + 2) .0. ## ((1 + 2) + 4) .1. ## ...

The next step spawns an action to evaluate 1:

1 .0. ## (1 + 2) .1. ## ((1 + 2) + 4) .1. ## ...

That expression can be fully evaluated in a single step, so the next step evaluates it, appends the result to the next Action down the stack, and pops the now-completed Action off the stack:

(1 + 2) .1. [[1]] ## ((1 + 2) + 4) .1. ## ...

The result 1 has been stored in the results list of the top Action, which is displayed between [[ and ]]. The top Action's pos is 1, so the next step begins evaluation of the second operand:

2 .0. ## (1 + 2) .2. [[1]] ## ((1 + 2) + 4) .1. ## ...

Which again can be evaluated immediately:

(1 + 2) .2. [[1, 2]] ## ((1 + 2) + 4) .1. ## ...

This expression has two operands, so now that pos is 2, all operands have been evaluated, and their results are in the corresponding entries of results. Thus, the next step can compute the expression value, passing it down to the parent Action and popping the completed action as before:

((1 + 2) + 4) .1. [[3]] ## ...

Evaluation now proceeds to the second operand:

4 .0. ## ((1 + 2) + 4) .2. [[3]] ## ...

Which, again, can be evaluated immediately:

((1 + 2) + 4) .2. [[3, 4]] ## ...

pos now indicates that all subexpressions have been evaluated, so the next step computes the final result of 7.