# Words ## Table of contents - [Overview](#overview) - [Keywords](#keywords) - [Alternatives considered](#alternatives-considered) - [References](#references) ## Overview A _word_ is a lexical element formed from a sequence of letters or letter-like characters, such as `fn` or `Foo` or `Int`. The exact lexical form of words has not yet been settled. However, Carbon will follow lexical conventions for identifiers based on [Unicode Annex #31](https://unicode.org/reports/tr31/). TODO: Update this once the precise rules are decided; see the [Unicode source files](/proposals/p0142.md#characters-in-identifiers-and-whitespace) proposal. Carbon source files, including comments and string literals, are required to be in Unicode Normalization Form C (NFC). ## Keywords The following words are interpreted as keywords: - `abstract` - `addr` - `alias` - `and` - `api` - `as` - `auto` - `base` - `break` - `case` - `class` - `constraint` - `continue` - `default` - `else` - `extends` - `external` - `final` - `fn` - `for` - `forall` - `friend` - `if` - `impl` - `impls` - `import` - `in` - `interface` - `let` - `library` - `like` - `match` - `namespace` - `not` - `observe` - `or` - `override` - `package` - `partial` - `private` - `protected` - `return` - `returned` - `Self` - `then` - `type` - `var` - `virtual` - `where` - `while` ## Alternatives considered - [Character encoding: We could restrict words to ASCII.](/proposals/p0142.md#character-encoding-1) - [Normalization form alternatives considered](/proposals/p0142.md#normalization-forms) ## References - Proposal [#142: Unicode source files](https://github.com/carbon-language/carbon-lang/pull/142)