4 jaren geleden · b9619db402
--- a/proposals/p0144.md
+++ b/proposals/p0144.md
@@ -0,0 +1,318 @@
 
															+# Numeric literal semantics
														
 
															+
														
 
															+<!--
														
 
															+Part of the Carbon Language project, under the Apache License v2.0 with LLVM
														
 
															+Exceptions. See /LICENSE for license information.
														
 
															+SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
														
 
															+-->
														
 
															+
														
 
															+[Pull request](https://github.com/carbon-language/carbon-lang/pull/144)
														
 
															+
														
 
															+## Table of contents
														
 
															+
														
 
															+<!-- toc -->
														
 
															+
														
 
															+## Table of contents
														
 
															+
														
 
															+-   [Problem](#problem)
														
 
															+-   [Background](#background)
														
 
															+-   [Proposal](#proposal)
														
 
															+-   [Details](#details)
														
 
															+    -   [Prelude support](#prelude-support)
														
 
															+    -   [Implicit conversions](#implicit-conversions)
														
 
															+    -   [Examples](#examples)
														
 
															+-   [Alternatives considered](#alternatives-considered)
														
 
															+    -   [Use an ordinary integer or floating-point type for literals](#use-an-ordinary-integer-or-floating-point-type-for-literals)
														
 
															+    -   [Use same type for all literals](#use-same-type-for-all-literals)
														
 
															+    -   [Allow leading `-` in literal tokens](#allow-leading---in-literal-tokens)
														
 
															+
														
 
															+<!-- tocstop -->
														
 
															+
														
 
															+## Problem
														
 
															+
														
 
															+When a numeric literal appears in a program, we need to understand its
														
 
															+semantics:
														
 
															+
														
 
															+-   What type does it have?
														
 
															+-   What value is produced by operations on it?
														
 
															+-   When can it validly be used to initialize an object?
														
 
															+
														
 
															+## Background
														
 
															+
														
 
															+In C++, numeric literals have either an integral type or a floating-point type.
														
 
															+C++ provides permission for implementations to add extended integral types, but
														
 
															+in practice (for bad reasons relating to `intmax_t`) implementations do not do
														
 
															+so, so there are a small finite set of types that any given numeric literal
														
 
															+might have:
														
 
															+
														
 
															+-   `int`, `long`, `long long`, or `unsigned` versions of these
														
 
															+-   `float`, `double`, or `long double`
														
 
															+
														
 
															+The choice of type is determined solely by the literal.
														
 
															+
														
 
															+The C++ approach is error-prone and problematic:
														
 
															+
														
 
															+-   Lossy conversions from literals in initializers are permitted.
														
 
															+-   Lossy operations on literals are permitted; for example, on a typical
														
 
															+    implementation, `1 << 60` has value `0` because `1` is a 32-bit type.
														
 
															+-   Attempting to naturally express some values has undefined behavior; for
														
 
															+    example, `int x = -2147483648;` typically results in undefined behavior even
														
 
															+    when -2147483648 is a valid `int` value.
														
 
															+-   Integer literals with value 0 have special semantics that are lost when the
														
 
															+    integer is passed to a function: "perfect" forwarding doesn't work for such
														
 
															+    literals.
														
 
															+-   The built-in types are privileged: only the types listed above have
														
 
															+    literals. There is no syntax for a 64-bit integer literal, only for (for
														
 
															+    example) a `long int` literal, which may or may not 64 bits wide.
														
 
															+-   The type of a literal can be unpredictable in portable code, as it can
														
 
															+    depend on which type a particular value happens to fit into.
														
 
															+
														
 
															+## Proposal
														
 
															+
														
 
															+Numeric literals have a type derived from their value, and can be converted to
														
 
															+any type that can represent that value.
														
 
															+
														
 
															+Simple operations such as arithmetic that involve only literals also produce
														
 
															+values of literal types.
														
 
															+
														
 
															+## Details
														
 
															+
														
 
															+Numeric literals have a type derived from their value. Two integer literals have
														
 
															+the same type if and only if they represent the same integer. Two real number
														
 
															+literals have the same type if and only if they represent the same real number.
														
 
															+
														
 
															+That is:
														
 
															+
														
 
															+-   For every integer, there is a type representing literals with that integer
														
 
															+    value.
														
 
															+-   For every rational number, there is a type representing literals with that
														
 
															+    real value.
														
 
															+-   The types for real numbers are distinct from the types for integers, even
														
 
															+    for real numbers that represent integers. `var x: i32 = 1.0;` is invalid.
														
 
															+
														
 
															+Primitive operators are available between numeric literals, and produce values
														
 
															+with numeric literal types. For example, the type of `1 + 2` is the same as the
														
 
															+type of `3`.
														
 
															+
														
 
															+Numeric types can provide conversions to support initialization from numeric
														
 
															+literals. Because the value of the literal is carried in the type, a type-level
														
 
															+decision can be made as to whether the conversion is valid.
														
 
															+
														
 
															+The integer types defined in the standard library permit conversion from integer
														
 
															+literal types whose values are representable in the integer type. The
														
 
															+floating-point types defined in the Carbon library permit conversion from
														
 
															+integer and rational literal types whose values are between the minimum and
														
 
															+maximum finite value representable in the floating-point type.
														
 
															+
														
 
															+### Prelude support
														
 
															+
														
 
															+The following types are defined in the Carbon prelude:
														
 
															+
														
 
															+-   An arbitrary-precision integer type.
														
 
															+
														
 
															+    ```
														
 
															+    class BigInt;
														
 
															+    ```
														
 
															+
														
 
															+-   A rational type, parameterized by a type used for its numerator and
														
 
															+    denominator.
														
 
															+
														
 
															+    ```
														
 
															+    class Rational(T:! Type);
														
 
															+    ```
														
 
															+
														
 
															+    The exact constraints on `T` are not yet decided.
														
 
															+
														
 
															+-   A type representing integer literals.
														
 
															+
														
 
															+    ```
														
 
															+    class IntLiteral(N:! BigInt);
														
 
															+    ```
														
 
															+
														
 
															+-   A type representing floating-point literals.
														
 
															+
														
 
															+    ```
														
 
															+    class FloatLiteral(X:! Rational(BigInt));
														
 
															+    ```
														
 
															+
														
 
															+All of these types are usable during compilation. `BigInt` supports the same
														
 
															+operations as `Int(n)`. `Rational(T)` supports the same operations as
														
 
															+`Float(n)`.
														
 
															+
														
 
															+The types `IntLiteral(n)` and `FloatLiteral(x)` also support primitive integer
														
 
															+and floating-point operations such as arithmetic and comparison, but these
														
 
															+operations are typically heterogeneous: for example, an addition between
														
 
															+`IntLiteral(n)` and `IntLiteral(m)` produces a value of type
														
 
															+`IntLiteral(n + m)`.
														
 
															+
														
 
															+### Implicit conversions
														
 
															+
														
 
															+`IntLiteral(n)` converts to any sufficiently large integer type, as if by:
														
 
															+
														
 
															+```
														
 
															+impl [template N:! BigInt, template M:! BigInt]
														
 
															+    IntLiteral(N) as ImplicitAs(Int(M))
														
 
															+    if N >= Int(M).MinValue as BigInt and N <= Int(M).MaxValue as BigInt {
														
 
															+  ...
														
 
															+}
														
 
															+impl [template N:! BigInt, template M:! BigInt]
														
 
															+    IntLiteral(N) as ImplicitAs(Unsigned(M))
														
 
															+    if N >= Int(M).MinValue as BigInt and N <= Int(M).MaxValue as BigInt {
														
 
															+  ...
														
 
															+}
														
 
															+```
														
 
															+
														
 
															+The above is for exposition purposes only; various parts of this syntax are not
														
 
															+yet decided.
														
 
															+
														
 
															+Similarly, `IntLiteral(x)` and `FloatLiteral(x)` convert to any sufficiently
														
 
															+large floating-point type, and produce the nearest representable floating-point
														
 
															+value. Conversions in which `x` lies exactly half-way between two values are
														
 
															+rejected, as
														
 
															+[previously decided](/docs/design/lexical_conventions/numeric_literals.md#ties).
														
 
															+Conversions in which `x` is outside the range of finite values of the
														
 
															+floating-point type are also represented, rather than saturating to the finite
														
 
															+range or producing an infinity.
														
 
															+
														
 
															+### Examples
														
 
															+
														
 
															+```carbon
														
 
															+// This is OK: the initializer is of the integer literal type with value
														
 
															+// -2147483648 despite being written as a unary `-` applied to a literal.
														
 
															+var x: i32 = -2147483648;
														
 
															+
														
 
															+// This initializes y to 2^60.
														
 
															+var y: i64 = 1 << 60;
														
 
															+
														
 
															+// This forms a rational literal whose value is one third, and converts it to
														
 
															+// the nearest representable value of type `f64`.
														
 
															+var z: f64 = 1.0 / 3.0;
														
 
															+
														
 
															+// This is an error: 300 cannot be represented in type `i8`.
														
 
															+var c: i8 = 300;
														
 
															+
														
 
															+fn f[template T:! Type](v: T) {
														
 
															+  var x: i32 = v * 2;
														
 
															+}
														
 
															+
														
 
															+// OK: x = 2_000_000_000.
														
 
															+f(1_000_000_000);
														
 
															+
														
 
															+// Error: 4_000_000_000 can't be represented in type `i32`.
														
 
															+f(2_000_000_000);
														
 
															+
														
 
															+// No storage required for the bound when it's of integer literal type.
														
 
															+struct Span(template T:! Type, template BoundT:! Type) {
														
 
															+  var begin: T*;
														
 
															+  var bound: BoundT;
														
 
															+}
														
 
															+
														
 
															+// Returns 1, because 1.3 can implicitly convert to f32, even though conversion
														
 
															+// to f64 might be a more exact match.
														
 
															+fn G() -> i32 {
														
 
															+  match (1.3) {
														
 
															+    case _: f32 => { return 1; }
														
 
															+    case _: f64 => { return 2; }
														
 
															+  }
														
 
															+}
														
 
															+
														
 
															+// Can only be called with a literal 0.
														
 
															+fn PassMeZero(_: IntLiteral(0));
														
 
															+
														
 
															+// Can only be called with integer literals in the given range.
														
 
															+fn ConvertToByte[template N:! BigInt](_: IntLiteral(N)) -> i8
														
 
															+    if N >= -128 and N <= 127 {
														
 
															+  return N as i8;
														
 
															+}
														
 
															+
														
 
															+// Given any int literal, produces a literal whose value is one higher.
														
 
															+fn OneHigher(L: IntLiteral(template _:! BigInt)) -> auto {
														
 
															+  return L + 1;
														
 
															+}
														
 
															+// Error: 256 can't be represented in type `i8`.
														
 
															+var v: i8 = OneHigher(255);
														
 
															+```
														
 
															+
														
 
															+## Alternatives considered
														
 
															+
														
 
															+### Use an ordinary integer or floating-point type for literals
														
 
															+
														
 
															+We could decide on a fixed-width type based on the form of the literal, for
														
 
															+example using a type suffix with some rules to determine what type to pick for
														
 
															+unsuffixed literals.
														
 
															+
														
 
															+Advantages:
														
 
															+
														
 
															+-   This follows what C++ does.
														
 
															+-   Can determine the type of a floating-point number without requiring
														
 
															+    contextual information.
														
 
															+
														
 
															+Disadvantages:
														
 
															+
														
 
															+-   Surprising behavior when applying an operator to a literal would result in
														
 
															+    overflow. Even if we diagnose this, a diagnostic that `-2147483648` is
														
 
															+    invalid because it overflows is surprising.
														
 
															+-   Creates additional literal syntax that users will need to understand.
														
 
															+-   May select types that don't match the programmer's expectations.
														
 
															+-   Whatever types we pick are privileged.
														
 
															+
														
 
															+### Use same type for all literals
														
 
															+
														
 
															+We could give literals a single, arbitrary-precision type (say, `Integer` for
														
 
															+integer literals and `Rational` for real literals).
														
 
															+
														
 
															+Advantages:
														
 
															+
														
 
															+-   Only introduces two new types, not an unbounded parameterized family of
														
 
															+    types.
														
 
															+-   Writing a function that takes any integer literal can be done with more
														
 
															+    obvious syntax and less syntactic overhead. Instead of:
														
 
															+    ```
														
 
															+    fn OneHigher(L: IntLiteral(template _:! BigInt));
														
 
															+    ```
														
 
															+    we could write
														
 
															+    ```
														
 
															+    fn OneHigher(template L:! Integer);
														
 
															+    ```
														
 
															+    However, with this proposal, a function taking any integer expression that
														
 
															+    can be evaluated to a constant can be written as
														
 
															+    ```
														
 
															+    fn F(template N:! BigInt);
														
 
															+    ```
														
 
															+    and such a function would accept all integer literals, as well as
														
 
															+    non-literal constants.
														
 
															+
														
 
															+Disadvantages:
														
 
															+
														
 
															+-   Our mechanism for specifying the behavior of operations such as arithmetic
														
 
															+    is based on interface implementations, which are looked up by type.
														
 
															+    Supporting `impl` selection based on values would introduce substantial
														
 
															+    complexity.
														
 
															+-   If we introduce an arbitrary-precision integer type, it would be
														
 
															+    inconsistent to support it only during compilation. However, if we allow its
														
 
															+    use at runtime, programs may use it accidentally, with an invisible
														
 
															+    performance cost. For example, `var x: auto = 123;` would result in `x`
														
 
															+    having an infinite-precision type, possibly involving invisible dynamic
														
 
															+    allocation.
														
 
															+    -   Under this proposal, the type of `x` is a type that can only represent
														
 
															+        the value `123`; as such, `x` is effectively immutable. The
														
 
															+        arbitrary-precision integer type introduced in this proposal can only be
														
 
															+        used explicitly by programs naming it.
														
 
															+
														
 
															+### Allow leading `-` in literal tokens
														
 
															+
														
 
															+We could treat a leading `-` character as part of a numeric literal token, so
														
 
															+that -- for example -- `-123` would be a single `-123` token rather than a unary
														
 
															+negation applied to a literal `123`.
														
 
															+
														
 
															+Advantages:
														
 
															+
														
 
															+-   This would narrowly solve the problem that `INT_MIN` cannot be written
														
 
															+    directly, without any of the other implications of this proposal.
														
 
															+
														
 
															+Disadvantages:
														
 
															+
														
 
															+-   Makes the behavior of unary `-` less uniform.
														
 
															+-   Prevents the introduction of infix or postfix operators that bind more
														
 
															+    tightly than unary `-`, such as an infix exponentiation operator: `-2**2`
														
 
															+    may be expected to evaluate to -4, not to +4.