OpenAPI Tips
OpenAPI tips - Data types and formats
Tristan Cartledge
December 6, 2022
Announcing: OpenAPI Reference
Hi! These blog posts have been popular, so we’ve built an entire OpenAPI Reference Guide to answer any questions you have.
It includes detailed information on data types.
Happy Spec Writing!
The problem
The OpenAPI spec is best known for descriptions of RESTful APIs, but it’s designed to be capable of describing any HTTP API whether that be REST or something more akin to RPC based calls.
That leads to the spec having a lot of flexibility baked-in: there’s a lot of ways to achieve the exact same result that are equally valid in the eyes of the spec. Because of this, the OpenAPI (opens in a new tab) documentation is very ambiguous when it comes to how you should define your API.
That’s why we’re taking the time to eliminate some of the most common ambiguities that you’ll encounter when you build your OpenAPI spec. In this case we’ll be taking a look at how to effectively use data types in your OpenAPI 3.0.X spec.
Note: We will cover the differences introduced by 3.1 in a future post.
Recommended practices
The OpenAPI Spec gives you plenty of options for describing your types, but also a lot of options to describe them loosely. Loose is fine if your goal is to have a spec that is valid, but if you are using your OpenAPI document to generate: code, documentation or other artifacts, loose will get you into trouble.
Describe your types as accurately as possible; you will not only improve the documentation of your API reducing ambiguity for end-users), but will give as much information as possible to any tools you might be using to generate code, documentation or other artifacts from your OpenAPI document. Concretely, we recommend that you:
- Describe your types as explicitly as possible by using the OpenAPI defined formats.
- Use additional validation attributes as much as possible: mark properties as required, set readOnly/writeOnly, and indicate when fields that are nullable.
Below, we will step through the different types available in OpenAPI and explain how to use formats, patterns and additional attributes to give you a spec that is descriptive and explicit.
The data types
In addition to an object type, for custom type definitions, the OpenAPI Specification (opens in a new tab) supports most of the “primitive” types and objects you would expect to describe what your API is capable of sending and receiving:
For each of these primitive types, there is a set of commonly-used formats (i.e. date format for string) which you can designate to enforce additional constraints on the values of a schema or field. There is also the option of associating a nullable attribute. These options lead to a number of different possibilities for describing your data.
The OpenAPI Spec also includes the ability to describe more complex relationships between types using the oneOf/anyOf/allOf attributes and providing the ability to describe enums but we will leave the discussion of them to a future blog post.
For now, let’s explore the various types and options available for describing your types.
string
Of the primitive types (ignoring the object type) , the string type is the most flexible type available. In addition to being able to be used to represent other types (such as “true”
, “100”
, “{\\“some\\”: \\”object\\”}”
), it supports a number of formats that overlay constraints to the type of data represented. This is useful for mapping to types in various languages if you are using the OpenAPI spec for code generation.
Formats
The string type via the OpenAPI Specification officially supports the below formats:
Type | Format | Explanation | Example |
---|---|---|---|
string | date | An RFC3339 (opens in a new tab) formatted date string | “2022-01-30” |
string | date-time | An RFC3339 (opens in a new tab) formatted date-time string | “2019-10-12T07:20:50.52Z” |
string | password | Provides a hint that the string may contain sensitive information. | “mySecretWord1234” |
string | byte | Base-64 encoded data. | “U3BlYWtlYXN5IG1ha2VzIHdvcmtpbmcgd2l0aCBBUElzIGZ1biE=” |
string | binary | Binary data, used to represent the contents of a file. | “01010101110001” |
The format attribute can also be used to describe a number of other formats the string might represent but outside the official list above, those formats might not be supported by tooling that works with the OpenAPI Spec, meaning that they would be provided more as hints to end-users of the API:
- uuid
- uri
- hostname
- ipv4 & ipv6
- and others
Below are some examples of describing various string types:
# A basic stringschema:type: string# A string that represents a RFC3339 formatted date-time stringschema:type: stringformat: date-time# A string that represents a enum with the specified valuesschema:type: stringenum:- "one"- "two"- "three"# A string that represents a fileschema:type: stringformat: binary
Patterns
The string type also has an associated pattern attribute that can be provided to define a regular expression that should be matched by any string represented by that type. The format of the regular expression is based on Javascript (opens in a new tab) and therefore could describe regular expressions that might not be supported by various tools or target languages, so make sure to check the compatibility with your intended targets.
Example of a string defined with a regex pattern:
# A string that must match the specified patternschema:type: stringpattern: ^[a-zA-Z0-9_]*$
number/integer
The number/integer types allow the description of various number formats through a combination of the type and format attributes, along with a number of attributes for validating the data, the spec should cover most use cases.
Available formats are:
Type | Format | Explanation | Example |
---|---|---|---|
number | Any number integer/float at any precision. | 10 or 1.9 or 9223372036854775807 | |
number | float | 32-bit floating point number. | 1.9 |
number | double | 64-bit floating point number. | 1.7976931348623157 |
integer | Any integer number. | 2147483647 or 9223372036854775807 | |
integer | int32 | 32-bit integer. | 2147483647 |
integer | int64 | 64-bit integer. | 9223372036854775807 |
Below are some examples of defining number/integer types:
# Any numberschema:type: number# A 32-bit floating point numberschema:type: numberformat: float# A 64-bit floating point numberschema:type: numberformat: double# Any integerschema:type: integer# A 32-bit integerschema:type: integerformat: int32# A 64-bit integerschema:type: integerformat: int64
Various tools may treat a number/integer without a format attribute as a type capable of holding the closest representation of that number in the target language. For example, a number might be represented by a double, and an integer by an int64. Therefore, it’s recommended that you be explicit with the format of your number type and always populate the format attribute.
The number type also has some optional attributes for additional validation:
- minimum: The minimum inclusive number the value should contain.
- maximum: The maximum inclusive number the value should contain.
- exclusiveMinimum: Make the minimum number exclusive.
- exclusiveMaximum: Make the maximum number exclusive.
- multipleOf: Specify the number/integer is a multiple of the provided value.
Some examples are below:
# An integer with a minimum inclusive value of 0schema:type: integerformat: int32minimum: 10# An integer with a minimum exclusive value of 0schema:type: integerformat: int32minimum: 0exclusiveMinimum: true# A float with a range between 0 and 1schema:type: numberformat: floatminimum: 0maximum: 1# A double with an exclusive maximum of 100schema:type: numberformat: doublemaximum: 100exclusiveMaximum: true# An 64 but integer that must be a multiple of 5schema:type: integerformat: int64multipleOf: 5
boolean
The boolean type is simple; it represents either true or false. Be aware that it doesn’t support other truthy/falsy values like: 1 or 0, an empty string “” or null. It has no additional attributes to control its format or validation.
# A boolean typeschema:type: boolean
array
The array type provides a way of defining a list of other types through providing an items attribute that represents the schema of the type contained in the array.
# An array of stringschema:type: arrayitems:type: string# An array of objectsschema:type: arrayitems:type: objectproperties:name:type: stringage:type: integer# An array of arbitrary thingsschema:type: arrayitems: {}
The array type will support any schema that describes any other type in its items attribute including types using oneOf/anyOf/allOf attributes. The array type also has some optional attributes for additional validation:
- minItems: The minimum number of items the array must contain.
- maxItems: The maximum number of items the array must contain.
- uniqueItems: The array must contain only unique items.
# An array of floats that must contain at least 1 element.schema:type: arrayitems:type: numberformat: floatminItems: 1# An array of strings that must contain at most 10 elements.schema:type: arrayitems:type: stringmaxItems: 10# An array of booleans that must contain exactly 3 elements.schema:type: arrayitems:type: booleanminItems: 3maxItems: 3# An array of strings that must contain only unique elements.schema:type: arrayitems:type: stringuniqueItems: true
object
The object type allows simple and complex objects, dictionaries, and free-form objects, along with a number of attributes to control validation.
Fully typed object
Fully typed objects can be described by providing a properties attribute that lists each property of the object and its associated type.
# A fully typed objectschema:type: objectproperties:name:type: stringage:type: integerformat: int32active:type: boolean# A fully typed object with a nested objectschema:type: objectproperties:name:type: stringage:type: integerformat: int32active:type: booleanaddress:type: objectproperties:street:type: stringcity:type: stringstate:type: stringzip:type: string
Objects with properties have access to some additional attributes that allow the objects to be validated in various ways:
- required: A list of properties that are required. Specified at the object level.
- readOnly: A property that is only available in a response.
- writeOnly: A property that is only available in a request.
# A fully typed object with all fields requiredschema:type: objectproperties:name:type: stringage:type: integerformat: int32active:type: booleanrequired:- name- age- active# A fully typed object with only one field requiredschema:type: objectproperties:name:type: stringage:type: integerformat: int32active:type: booleanrequired:- name# A fully typed object with some field as read-onlyschema:type: objectproperties:name:type: stringage:type: integerformat: int32active:type: booleanreadOnly: true # This field is only returned in a responserequired:- name- age- active # This field will only be required in a response# A fully typed object with some field as write-onlyschema:type: objectproperties:name:type: stringage:type: integerformat: int32active:type: booleanisHuman:type: booleanwriteOnly: true # This field is only required in a requestrequired:- name- age- active- isHuman # This field will only be required in a request
Using object for dictionaries
The object type can also be used to describe dictionaries/maps/etc that use strings for keys and support any value type that can be described by the OpenAPI Spec.
# A dictionary of string valuesschema:type: objectadditionalProperties:type: string# A dictionary of objectsschema:type: objectadditionalProperties:type: objectproperties:name:type: stringage:type: integerformat: int32
You can also describe dictionaries that will contain certain keys
# A dictionary that must contain at least the specified keysschema:type: objectproperties:name:type: string # Must match type of additionalPropertiesrequired:- nameadditionalProperties:type: string
When using the additionalProperties attribute you can also specify additional attributes to validate the number of properties in the object:
- minProperties: The minimum number of properties allowed in the object.
- maxProperties: The maximum number of properties allowed in the object.
For example:
# A dictionary of string values that has at least one key.schema:type: objectadditionalProperties:type: stringminProperties: 1# A dictionary of string values that has at most 10 keys.schema:type: objectadditionalProperties:type: stringmaxProperties: 10# A dictionary of string values that has 1 key.schema:type: objectadditionalProperties:type: stringminProperties: 1maxProperties: 1
Free-form objects
The object type can also be used to describe any arbitrary key/value pair (where the keys are still required to be strings).
# An arbitrary object/dictionary that can contain any value.schema:type: objectadditionalProperties: true# An alternate way to specify an arbitrary object/dictionary that can contain any value.schema:type: objectadditionalProperties: {}
null
OpenAPI 3.0.X doesn’t support a null type but instead allows you to mark a schema as being nullable. This allows that type to either contain a valid value or null.
# A nullable stringschema:type: stringnullable: true# A nullable integerschema:type: integerformat: int32nullable: true# A nullable booleanschema:type: booleannullable: true# A nullable arrayschema:type: arrayitems:type: stringnullable: true# A nullable objectschema:type: objectproperties:foo:type: stringnullable: true