|
|
vor 4 Jahren | |
|---|---|---|
| .. | ||
| code_and_name_organization | vor 5 Jahren | |
| control_flow | vor 4 Jahren | |
| generics | vor 4 Jahren | |
| interoperability | vor 5 Jahren | |
| lexical_conventions | vor 5 Jahren | |
| README.md | vor 4 Jahren | |
| aliases.md | vor 5 Jahren | |
| blocks_and_statements.md | vor 5 Jahren | |
| functions.md | vor 4 Jahren | |
| metaprogramming.md | vor 5 Jahren | |
| name_lookup.md | vor 4 Jahren | |
| naming_conventions.md | vor 5 Jahren | |
| operators.md | vor 5 Jahren | |
| pattern_matching.md | vor 5 Jahren | |
| primitive_types.md | vor 4 Jahren | |
| structs.md | vor 5 Jahren | |
| templates.md | vor 5 Jahren | |
| tuples.md | vor 5 Jahren | |
| variables.md | vor 4 Jahren | |
This documentation describes the design of the Carbon language, and the rationale for that design. The goal is to provide sufficient coverage of the design to support the following audiences:
For Carbon developers, documentation that is more suitable for learning the language will be made available separately.
Eventually, this document hopes to provide a high-level overview of the design of the Carbon language. It should summarize the key points across the different aspects of the language design and link to more detailed and comprehensive design documents to expand on specific aspects of the design. That means it isn't and doesn't intend to be complete or stand on its own. Notably, it doesn't attempt to provide detailed and comprehensive justification for design decisions. Those should instead be provided by the dedicated and focused designs linked to from here. However, it should provide an overarching view of the design and a good basis for diving into specific details.
However, these are extremely early days for Carbon. Currently, this document tries to capture two things:
The utility of capturing these at this early stage is primarily to give everyone a reasonably consistent set of terminology and context as we begin fleshing out concrete (and well justified) designs for each part of the language. In some cases, it captures ideas that may be interesting to explore, but isn't meant to overly anchor on them. Any ideas here need to be fully explored and justified with a detailed analysis. The context of #1 (directly evolving C++, experience building Clang, and experience working on C++ codebases including Clang and LLVM themselves) is also important. It is both an important signal but also a bias.
In order to keep example code consistent, we are making choices that may change
later. In particular, where $ is shown in examples, it is a placeholder: $
is a well-known bad symbol due to international keyboard layouts, and will be
cleaned up during evolution.
References: Source files and lexical conventions
TODO: References need to be evolved.
// .... However, they are required to be the only
non-whitespace on the line for readability.Block comments look like //\{ ... //\}, with each marker on its own line.
Nested block comments are supported using named regions. For example:
live code
//\{
commented code
//\{ nested block
commented code in nested block
//\} nested block
//\}
live code
Decimal, hexadecimal, and binary integer literals and decimal and
hexadecimal floating-point literals are supported, with _ as a digit
separator. For example, 42, 0b1011_1101 and 0x1.EEFp+5. Numeric
literals are case-sensitive: 0x, 0b, e+, and p+ must be lowercase,
whereas hexadecimal digits must be uppercase. A digit is required on both
sides of a period.
References: Code and name organization
Name paths in Carbon always start with the package name. Additional namespaces may be specified as desired.
For example, this code declares a struct Geometry.Shapes.Flat.Circle in a
library Geometry/OneSide:
package Geometry library("OneSide") namespace Shapes;
namespace Flat;
struct Flat.Circle { ... }
This type can be used from another package:
package ExampleUser;
import Geometry library("OneSide");
fn Foo(Geometry.Shapes.Flat.Circle circle) { ... }
References: Lexical conventions
TODO: References need to be evolved.
Various constructs introduce a named entity in Carbon. These can be functions, types, variables, or other kinds of entities that we'll cover. A name in Carbon is formed from a word, which is a sequence of letters, numbers, and underscores, and which starts with a letter. We intend to follow Unicode's Annex 31 in selecting valid identifier characters, but a concrete set of valid characters has not been selected yet.
References: Naming conventions
TODO: References need to be evolved.
Our current proposed naming convention are:
UpperCamelCase for names of compile-time resolved constants, whether they
participate in the type system or not.lower_snake_case for keywords and names of run-time resolved values.As a matter of style and consistency, we will follow these conventions where possible and encourage convergence.
For example:
N.N, since it will be a constant available to the
compiler at code generation time.UpperCamelCase.lower_snake_case.import uses lower_snake_case.References: Aliases
TODO: References need to be evolved.
Carbon provides a facility to declare a new name as an alias for a value. This is a fully general facility because everything is a value in Carbon, including types.
For example:
alias MyInt = Int;
This creates an alias called MyInt for whatever Int resolves to. Code
textually after this can refer to MyInt, and it will transparently refer to
Int.
References: name lookup
TODO: References need to be evolved.
Unqualified name lookup will always find a file-local result, including aliases.
References: Name lookup
TODO: References need to be evolved.
Common types that we expect to be used universally will be provided for every
file, including Int and Bool. These will likely be defined in a special
"prelude" package.
References: Lexical conventions and operators
TODO: References need to be evolved.
Expressions describe some computed value. The simplest example would be a
literal number like 42: an expression that computes the integer value 42.
Some common expressions in Carbon include:
42, 3.1419, "Hello World!"Operators:
++i, --j
-x1 + 2, 3 - 4, 2 * 5, 6 / 32 & 3, 2 | 4, 3 ^ 1, ~71 << 3, 8 >> 12 == 2, 3 != 4, 5 < 6, 7 > 6, 8 <= 8, 8 >= 8a and b, c or dParenthesized expressions: (7 + 8) * (3 - 1)
References: Functions
TODO: References need to be evolved.
Functions are the core unit of behavior. For example:
fn Sum(a: Int, b: Int) -> Int;
Breaking this apart:
fn is the keyword used to indicate a function.Sum.Int parameters, a and b.Int result.You would call this function like Sum(1, 2).
References: Blocks and statements
TODO: References need to be evolved.
The body or definition of a function is provided by a block of code containing statements. The body of a function is also a new, nested scope inside the function's scope, meaning that parameter names are available.
Statements within a block are terminated by a semicolon. Each statement can, among other things, be an expression.
For example, here is a function definition using a block of statements, one of which is nested:
fn Foo() {
Bar();
{
Baz();
}
}
References: Variables
Blocks introduce nested scopes and can contain local variable declarations that work similarly to function parameters.
For example:
fn Foo() {
var x: Int = 42;
}
Breaking this apart:
var is the keyword used to indicate a variable.x.Int.42.References: TODO
TODO: References need to be evolved.
References: Control flow
Blocks of statements are generally executed sequentially. However, statements are the primary place where this flow of execution can be controlled.
if and elseReferences: Control flow
if and else provide conditional execution of statements. For example:
if (fruit.IsYellow()) {
Print("Banana!");
} else if (fruit.IsOrange()) {
Print("Orange!");
} else {
Print("Vegetable!");
}
This code will:
Banana! if fruit.IsYellow() is True.Orange! if fruit.IsYellow() is False and fruit.IsOrange() is
True.Vegetable! if both of the above return False.whileReferences: Control flow
while statements loop for as long as the passed expression returns True. For
example, this prints 0, 1, 2, then Done!:
var x: Int = 0;
while (x < 3) {
Print(x);
++x;
}
Print("Done!");
forReferences: Control flow
for statements support range-based looping, typically over containers. For
example, this prints all names in names:
for (var name: String in names) {
Print(name);
}
PrintNames() prints each String in the names List in iteration order.
breakReferences: Control flow
The break statement immediately ends a while or for loop. Execution will
resume at the end of the loop's scope. For example, this processes steps until a
manual step is hit (if no manual step is hit, all steps are processed):
for (var step: Step in steps) {
if (step.IsManual()) {
Print("Reached manual step!");
break;
}
step.Process();
}
continueReferences: Control flow
The continue statement immediately goes to the next loop of a while or
for. In a while, execution continues with the while expression. For
example, this prints all non-empty lines of a file, using continue to skip
empty lines:
var f: File = OpenFile(path);
while (!f.EOF()) {
var line: String = f.ReadLine();
if (line.IsEmpty()) {
continue;
}
Print(line);
}
returnReferences: Control flow
The return statement ends the flow of execution within a function, returning
execution to the caller. If the function returns a value to the caller, that
value is provided by an expression in the return statement. For example:
fn Sum(a: Int, b: Int) -> Int {
return a + b;
}
References: Primitive types, tuples, and structs
TODO: References need to be evolved.
Carbon's core types are broken down into three categories:
The first two are intrinsic and directly built in the language. The last aspect of types allows for defining new types.
Expressions compute values in Carbon, and these values are always strongly typed much like in C++. However, an important difference from C++ is that types are themselves modeled as values; specifically, compile-time constant values. However, in simple cases this doesn't make much difference.
References: Primitive types
TODO: References need to be evolved.
These types are fundamental to the language as they aren't either formed from or modifying other types. They also have semantics that are defined from first principles rather than in terms of other operations. These will be made available through the prelude package.
Primitive types fall into the following categories:
Bool - a boolean type with two possible values: True and False.Int and UInt - signed and unsigned 64-bit integer types.
Int8, Int16, Int32, Int128, and Int256.Float64 - a floating point type with semantics based on IEEE-754.
Float16, Float32, and
Float128.BFloat16 is also provided.String - a byte sequence treated as containing UTF-8 encoded text.
StringView - a read-only reference to a byte sequence treated as
containing UTF-8 encoded text.References: Tuples
TODO: References need to be evolved.
The primary composite type involves simple aggregation of other types as a tuple. In formal type theory, tuples are product types.
An example use of tuples is:
fn DoubleBoth(x: Int, y: Int) -> (Int, Int) {
return (2 * x, 2 * y);
}
Breaking this example apart:
Int types.Int values.Both of these are expressions using the tuple syntax
(<expression>, <expression>). The only difference is the type of the tuple
expression: one is a tuple of types, the other a tuple of values.
Element access uses subscript syntax:
fn DoubleTuple(x: (Int, Int)) -> (Int, Int) {
return (2 * x[0], 2 * x[1]);
}
Tuples also support multiple indices and slicing to restructure tuple elements:
// This reverses the tuple using multiple indices.
fn Reverse(x: (Int, Int, Int)) -> (Int, Int, Int) {
return x[2, 1, 0];
}
// This slices the tuple by extracting elements [0, 2).
fn RemoveLast(x: (Int, Int, Int)) -> (Int, Int) {
return x[0 .. 2];
}
TODO: Needs a feature design and a high level summary provided inline.
TODO: Needs a feature design and a high level summary provided inline.
TODO: Needs a feature design and a high level summary provided inline.
References: Structs
TODO: References need to be evolved.
structs are a way for users to define their own data strutures or named
product types.
For example:
struct Widget {
var x: Int;
var y: Int;
var z: Int;
var payload: String;
}
Breaking apart Widget:
Widget has three Int members: x, y, and z.Widget has one String member: payload.dial, a member can be referenced with dial.paylod.More advanced structs may be created:
struct AdvancedWidget {
// Do a thing!
fn DoSomething(self: AdvancedWidget, x: Int, y: Int);
// A nested type.
struct Nestedtype {
// ...
}
private var x: Int;
private var y: Int;
}
fn Foo(thing: AdvancedWidget) {
thing.DoSomething(1, 2);
}
Breaking apart AdvancedWidget:
AdvancedWidget has a public object method DoSomething.
DoSomething explicitly indicates how the AdvancedWidget is passed to
it, and there is no automatic scoping - self must be specified as the
first input. The self name is also a keyword that explains how to
invoke this method on an object.DoSomething accepts AdvancedWidget by value, which is easily
expressed here along with other constraints on the object parameter.AdvancedWidget has two private data members: x and y.
AdvancedWidget only, providing a layer of easy validation of the most
basic interface constraints.Nestedtype is a nested type, and can be accessed as
AdvancedWidget.Nestedtype.TODO: Needs a feature design and a high level summary provided inline.
TODO: Needs a feature design and a high level summary provided inline.
TODO: Needs a feature design and a high level summary provided inline.
TODO: Needs a feature design and a high level summary provided inline.
TODO: Needs a feature design and a high level summary provided inline.
TODO: Needs a detailed design and a high level summary provided inline.
References: Pattern matching
TODO: References need to be evolved.
The most prominent mechanism to manipulate and work with types in Carbon is pattern matching. This may seem like a deviation from C++, but in fact this is largely about building a clear, coherent model for a fundamental part of C++: overload resolution.
match control flowReferences: Pattern matching
TODO: References need to be evolved.
match is a control flow similar to switch of C/C++ and mirrors similar
constructs in other languages, such as Swift.
An example match is:
fn Bar() -> (Int, (Float, Float));
fn Foo() -> Float {
match (Bar()...) {
case (42, (x: Float, y: Float)) => {
return x - y;
}
case (p: Int, (x: Float, _: Float)) if (p < 13) => {
return p * x;
}
case (p: Int, _: auto) if (p > 3) => {
return p * Pi;
}
default => {
return Pi;
}
}
}
Breaking apart this match:
Bar().
case that matches this value, and
execute that block.case pattern contains a value pattern, such as (Int p, auto _),
followed by an optional boolean predicate introduced by the if keyword.
case pattern to match.auto for a type will always match.Value patterns may be composed of the following:
42, whose value must be equal to match.: and followed by a type,
such as Int.
_ may be used to discard the value once
matched.(x: Float, y: Float), which match against tuples and tuple-like values by
recursively matching on their elements.References: Pattern matching
TODO: References need to be evolved.
Value patterns may be used when declaring local variables to conveniently destructure them and do other type manipulations. However, the patterns must match at compile time, so a boolean predicate cannot be used directly.
An example use is:
fn Bar() -> (Int, (Float, Float));
fn Foo() -> Int {
var (p: Int, _: auto) = Bar();
return p;
}
To break this apart:
Int returned by Bar() matches and is bound to p, then returned.(Float, Float) returned by Bar() matches and is discarded by
_: auto.References: Pattern matching
TODO: References need to be evolved. Needs a detailed design and a high level summary provided inline.
TODO: Needs a feature design and a high level summary provided inline.
TODO: Needs a feature design and a high level summary provided inline.
References: Templates
TODO: References need to be evolved.
Carbon templates follow the same fundamental paradigm as C++ templates: they are instantiated when called, resulting in late type checking, duck typing, and lazy binding. Although generics are generally preferred, templates enable translation of code between C++ and Carbon, and address some cases where the type checking rigor of generics are problematic.
References: Templates
TODO: References need to be evolved.
User-defined types may have template parameters. The resulting type-function may be used to instantiate the parameterized definition with the provided arguments in order to produce a complete type. For example:
struct Stack(T:$$ Type) {
var storage: Array(T);
fn Push(value: T);
fn Pop() -> T;
}
Breaking apart the template use in Stack:
Stack is a paremeterized type accepting a type T.T may be used within the definition of Stack anywhere a normal type
would be used, and will only be type checked on instantiation.var ... Array(T) instantiates a parameterized type Array when Stack is
instantiated.References: Templates
TODO: References need to be evolved.
Both implicit and explicit function parameters in Carbon can be marked as template parameters. When called, the arguments to these parameters trigger instantiation of the function definition, fully type checking and resolving that definition after substituting in the provided (or computed if implicit) arguments. The runtime call then passes the remaining arguments to the resulting complete definition.
fn Convert[T:$$ Type](source: T, U:$$ Type) -> U {
var converted: U = source;
return converted;
}
fn Foo(i: Int) -> Float {
// Instantiates with the `T` implicit argument set to `Int` and the `U`
// explicit argument set to `Float`, then calls with the runtime value `i`.
return Convert(i, Float);
}
Here we deduce one type parameter and explicitly pass another. It is not possible to explicitly pass a deduced type parameter; instead the call site should cast or convert the argument to control the deduction. In this particular example, the explicit type is passed after a runtime parameter. While this makes that type unavailable to the declaration of that runtime parameter, it still is a template parameter and available to use as a type in the remaining parts of the function declaration.
References: Templates
TODO: References need to be evolved.
An important feature of templates in C++ is the ability to customize how they end up specialized for specific arguments. Because template parameters (whether as type parameters or function parameters) are pattern matched, we expect to leverage pattern matching techniques to provide "better match" definitions that are selected analogously to specializations in C++ templates. When expressed through pattern matching, this may enable things beyond just template parameter specialization, but that is an area that we want to explore cautiously.
TODO: lots more work to flesh this out needs to be done...
References: Metaprogramming
TODO: References need to be evolved. Needs a detailed design and a high level summary provided inline.
Carbon provides metaprogramming facilities that look similar to regular Carbon code. These are structured, and do not offer arbitrary inclusion or preprocessing of source text such as C/C++ does.
Carbon provides some higher-order abstractions of program execution, as well as the critical underpinnings of such abstractions.
TODO: Needs a feature design and a high level summary provided inline.
TODO: Needs a feature design and a high level summary provided inline.
TODO: Needs a feature design and a high level summary provided inline.
References: Bidirectional interoperability with C/C++
TODO: References need to be evolved. Needs a detailed design and a high level summary provided inline.