15 SQL/JSON Path Expressions
Oracle Database provides SQL access to JSON data using SQL/JSON path expressions.
- Overview of SQL/JSON Path Expressions
Oracle Database provides SQL access to JSON data using SQL/JSON path expressions. - SQL/JSON Path Expression Syntax
SQL/JSON path expressions are matched by SQL/JSON functions and conditions against JSON data, to select portions of it. Path expressions can use wildcards and array ranges. Matching is case-sensitive. - SQL/JSON Path Expression Item Methods
The Oracle item methods available for a SQL/JSON path expression are described. - Types in Comparisons
Comparisons in SQL/JSON path-expression filter conditions are statically typed at compile time. If the effective types of the operands of a comparison are not known to be the same then an attempt is sometimes made to reconcile them by type-casting.
Parent topic: Query JSON Data
15.1 Overview of SQL/JSON Path Expressions
Oracle Database provides SQL access to JSON data using SQL/JSON path expressions.
JSON is a notation for JavaScript values. When JSON data is stored in the database you can query it using path expressions that are somewhat analogous to XQuery or XPath expressions for XML data. Similar to the way that SQL/XML allows SQL access to XML data using XQuery expressions, Oracle Database provides SQL access to JSON data using SQL/JSON path expressions.
SQL/JSON path expressions have a simple syntax. A path expression selects zero or more JSON values that match, or satisfy, it.
SQL/JSON condition json_exists
returns true if at least one value matches, and false if no value matches. If a single value matches, then SQL/JSON function json_value
returns that value if it is scalar and raises an error if it is non-scalar. If no value matches the path expression then json_value
returns SQL NULL
.
SQL/JSON function json_query
returns all of the matching values, that is, it can return multiple values. You can think of this behavior as returning a sequence of values, as in XQuery, or you can think of it as returning multiple values. (No user-visible sequence is manifested.)
In all cases, path-expression matching attempts to match each step of the path expression, in turn. If matching any step fails then no attempt is made to match the subsequent steps, and matching of the path expression fails. If matching each step succeeds then matching of the path expression succeeds.
The maximum length of a SQL/JSON path expression is 32K bytes.
Related Topics
Parent topic: SQL/JSON Path Expressions
15.2 SQL/JSON Path Expression Syntax
SQL/JSON path expressions are matched by SQL/JSON functions and conditions against JSON data, to select portions of it. Path expressions can use wildcards and array ranges. Matching is case-sensitive.
You pass a SQL/JSON path expression and some JSON data to a SQL/JSON function or condition. The path expression is matched against the data, and the matching data is processed by the particular SQL/JSON function or condition. You can think of this matching process in terms of the path expression returning the matched data to the function or condition.
- Basic SQL/JSON Path Expression Syntax
The basic syntax of a SQL/JSON path expression is presented. It is composed of a context-item symbol ($
) followed by zero or more object, array, and descendant steps, each of which can be followed by a filter expression, followed optionally by a function step. Examples are provided. - SQL/JSON Path Expression Syntax Relaxation
The basic SQL/JSON path-expression syntax is relaxed to allow implicit array wrapping and unwrapping. This means that you need not change a path expression in your code if your data evolves to replace a JSON value with an array of such values, or vice versa. Examples are provided.
Parent topic: SQL/JSON Path Expressions
15.2.1 Basic SQL/JSON Path Expression Syntax
The basic syntax of a SQL/JSON path expression is presented. It is
composed of a context-item symbol ($
) followed by zero or more object,
array, and descendant steps, each of which can be followed by a filter expression, followed
optionally by a function step. Examples are provided.
However, this basic syntax is extended by relaxing the matching of arrays and non-arrays against non-array and array patterns, respectively — see SQL/JSON Path Expression Syntax Relaxation.
Matching of data against SQL/JSON path expressions is case-sensitive.
-
A SQL/JSON basic path expression (also called just a path expression here) is an absolute path expression or a relative path expression.
-
An absolute path expression begins with a dollar sign (
$
), which represents the path-expression context item, that is, the JSON data to be matched. That data is the result of evaluating a SQL expression that is passed as argument to the SQL/JSON function. The dollar sign is followed by zero or more nonfunction steps, followed by an optional function step. -
A relative path expression is an at sign (
@
) followed by zero or more nonfunction steps, followed by an optional function step. It has the same syntax as an absolute path expression, except that it uses an at sign instead of a dollar sign ($
).A relative path expression is used inside a filter expression (filter, for short). The at sign represents the path-expression current filter item, that is, the JSON data that matches the part of the (surrounding) path expression that precedes the filter containing the relative path expression. A relative path expression is matched against the current filter item in the same way that an absolute path expression is matched against the context item.
-
A nonfunction step is an object step, an array step, or a descendant step, followed by an optional filter expression.
-
A single function step is optional in a basic path expression (absolute or a relative). If present, it is the last step of the path expression. It is a period (
.
), sometimes read as "dot", followed by a SQL/JSON item method, followed by a left parenthesis ((
) and then a right parenthesis ()
). The parentheses can have whitespace between them (such whitespace is insignificant).The item method is applied to the data that is targeted by the rest of the same path expression, which precedes the function step. The item method is used to transform that data. The SQL function or condition that is passed the path expression as argument uses the transformed data in place of the targeted data.
-
An object step is a period (
.
), followed by an object field name or an asterisk (*
) wildcard, which stands for (the values of) all fields. A field name can be empty, in which case it must be written as""
(no intervening whitespace). A nonempty field name must start with an uppercase or lowercase letter A to Z and contain only such letters or decimal digits (0-9), or else it must be enclosed in double quotation marks ("
).An object step returns the value of the field that is specified. If a wildcard is used for the field then the step returns the values of all fields, in no special order.
-
An array step is a left bracket (
[
) followed by either an asterisk (*
) wildcard, which stands for all array elements, or one or more specific array indexes or range specifications separated by commas (,
), followed by a right bracket (]
).An error is raised if you use both an asterisk and either an array index or a range specification. And an error is raised if no index or range specification is provided:
[]
is not a valid array step.An array index specifies a single array position, which is a whole number (0, 1, 2,...). An array index can thus be a literal whole number:
0
,1
,2
,… Array position and indexing are zero-based, as in the JavaScript convention for arrays: the first array element has index0
(specifying position 0).The last element of a nonempty array of any size can be referenced using the index
last
.An array index can also have the form
last - N
, where-
is a minus sign (hyphen) andN
is a literal whole number (0
,1
,2
,…) that is no greater than the array size minus 1.The next-to-last array element can be referenced using index
last-1
, the second-to-last by indexlast-2
, and so on. Whitespace surrounding the minus sign (hyphen) is ignored.For the array
["a", "b", 42]
, for example, the element at index1
(position 1) is the string"b"
— the second array element. The element at index2
, or indexlast
, is the number42
. The element at index0
, orlast-2
, is"a"
.For Oracle SQL function
json_transform
, you can also use an index of the formlast + N
, whereN
is a whole number. This lets you append new elements to an existing array, by specifying positions beyond the current array size minus 1. Whitespace surrounding the plus sign is ignored. You cannot use an index of this form in combination with other indexes, including in a range specification (see next). An error is raised in that case.A range specification has the form
N
toM
, whereN
andM
are array indexes, and whereto
is preceded and followed by one or more whitespace characters.Foot 1Range specification
N
toM
is equivalent to explicitly specifying all of the indexes fromN
toM
, includingN
andM
, in ascending order.In a range specification, the order of
N
andM
is not significant; the range of the third through sixth elements can be written as2 to 5
or5 to 2
. For a six-element array the same range can be written as2 to last
orlast to 2
. The range specificationN to N
(same indexN
on each side ofto
) is equivalent to the single indexN
(it is not equivalent to[N, N]
).The order in which array indexes and ranges are specified in an array step is significant; it is reflected in the array that results from the function that uses the path expression.
Multiple range specifications in the same array step are treated independently. In particular, overlapping ranges result in repetition of the elements in the overlap.
For example, suppose that you query using SQL/JSON function
json_query
with array wrapper (which wraps multiple query results to return a single JSON array), passing it a path expression with this array step:[3 to 1, 2 to 4, last-1 to last-2, 0, 0]
. The data returned by the query will include an array that is made from these elements of an array in your queried data, in order:-
second through fourth elements (range
3 to 1
) -
third through fifth elements (range
2 to 4
) -
second-from-last through next-to-last elements (range
last-1 to last-2
) -
first element (index
0
) -
first element again (index
0
)
When matching the array
["1", "2", "3", "4", "5", "6", "7", "8", "9"]
in your data, the array in the query result would be["2", "3", "4", "3", "4", "5", "7", "8", "1", "1"]
.If you use array indexes that specify positions outside the bounds (0 through the array size minus 1) of an array in your data, no error is raised. The specified path expression simply does not match the data — the array has no such position. (Matching of SQL/JSON path expressions follows this rule generally, not just for array steps.)
This is the case, for example, if you try to match an index of
last-6
against an array with fewer than 7 elements. For an array of 6 elements,last
is5
, solast-6
specifies an invalid position (less than 0).It is also the case if you try to match any array step against an empty array. For example, array steps
[0]
and[last]
both result in no match against the data array[]
. Step[0]
doesn't match because[]
has no first element, and step[last]
doesn't match because[]
has no element with index-1
(array length minus 1).It is also the case, in particular, if you use an index
last+N
(N
non-zero) other than with functionjson_transform
. Forjson_transform
this is used not to match an existing array element but to specify where, when modifying an existing array, to insert a new element.Because a range specification is equivalent to an explicit, ascending sequence of array indexes, any of those implicit indexes which are out of bounds cannot match any data. Like explicit indexes, they are ignored.
Another way to think of this is that range specifications are, in effect, truncated to the nearest bound (
0
orlast
) for a given data array. For example when matching the array["a", "b", "c"]
, the range specificationslast-3 to 1
,2 to last+1
, andlast-3 to last+1
are, in effect, truncated to0 to 1
,2 to 2
, and0 to 2
, respectively. The (implicit) out-of-bounds indexes for those ranges,last-3
(which is-1
, here) andlast+1
(which is3
, here), are ignored. -
-
A descendant step is two consecutive periods (
..
), sometimes read as "dot dot", followed by a field name (which has the same syntax as for an object step).It descends recursively into the objects or arrays that match the step immediately preceding it (or into the context item if there is no preceding step).
At each descendant level, for each object and for each array element that is an object, it gathers the values of all fields that have the specified name. It returns all of the gathered field values.
For example, consider this query and data:
json_query(some_json_column, '$.a..z' WITH ARRAY WRAPPER)
{ "a" : { "b" : { "z" : 1 }, "c" : [ 5, { "z" : 2 } ], "z" : 3 } "z" : 4 }
The query returns an array, such as
[1,2,3]
, whose elements are1
,2
, and3
. It gathers the value of each fieldz
within the step that immediately precedes the dot dot (..
), which is fielda
. The topmost fieldz
, with value 4, is not matched because it is not within the value of fielda
.The value of field
a
is an object, which is descended into.-
It has a field
z
, whose value (3
) is gathered. It also has a fieldb
whose value is an object, which is descended into to gather the value of its fieldz
, which is1
. -
It also has a field
c
whose value is an array, which has an element that is an object with a fieldz
, whose value (2
) is gathered.
The JSON values gathered are thus
3
,1
, and2
. They are wrapped in an array, in an undefined order. One of the possible return values is[1,2,3]
. -
-
A filter expression (filter, for short) is a question mark (
?
) followed by a filter condition enclosed in parentheses (()
). A filter is satisfied if its condition is satisfied, that is, returns true. -
A filter condition applies a predicate (Boolean function) to its arguments and is one of the following, where each of
cond
,cond1
, andcond2
stands for a filter condition.-
( cond )
: Parentheses are used for grouping, separating filter conditioncond
as a unit from other filter conditions that may precede or follow it. -
cond1 && cond2
: The conjunction (and) ofcond1
andcond2
, requiring that both be satisfied. -
cond1 || cond2
: The inclusive disjunction (or) ofcond1
andcond2
, requiring thatcond1
,cond2
, or both, be satisfied. -
! ( cond )
: The negation ofcond
, meaning thatcond
must not be satisfied. -
exists (
, followed by a relative path expression, followed by)
: The condition that the targeted data exists (is present). -
A comparison, which is one of the following:
-
A relative path expression, followed by a comparison predicate, followed by either a JSON scalar value or a SQL/JSON variable.
-
Either a JSON scalar value or a SQL/JSON variable, followed by a comparison predicate, followed by a relative path expression.
-
A JSON scalar value, followed by a comparison predicate, followed by another JSON scalar value.
-
A relative path expression, followed by
has substring
,starts with
,like
,like_regex
, oreq_regex
, followed by either a JSON string or a SQL/JSON variable that is bound to a SQL string (which is automatically converted from the database character set to UTF8).-
has substring
means that the matching data value has the specified string as a substring. -
starts with
means that the matching data value has the specified string as a prefix. -
like
means that the JSON string data value matches the specified string, which is interpreted as a SQLLIKE
pattern that uses SQLLIKE4
character-set semantics. A percent sign (%
) in the pattern matches zero or more characters. An underscore (_
) matches a single character.Note:
Unlike the case for SQL
LIKE
, you cannot choose the escape character for path-expression predicatelike
— it is always character`
, GRAVE ACCENT (U+0060), also known sometimes as backquote or backtick.In database releases prior to 21c there is no escape character for path-expression predicate
like
. For such releases Oracle recommends that you avoid using character`
, GRAVE ACCENT (U+0060) inlike
patterns. -
like_regex
means that the JSON string data value matches the specified string, which is interpreted as a SQLREGEXP LIKE
regular expression pattern that uses SQLLIKE4
character-set semantics.like_regex
is exceptional among the pattern-matching comparisons, in that its pattern matches the empty JSON string (""
). -
eq_regex
is just likelike_regex
, except for these two differences:-
eq_regex
matches its regular expression pattern against the entire JSON string data value — the full string must match the pattern for the comparison to be satisfied.like_regex
is satisfied if any portion of the JSON string matches the pattern. -
The
eq_regex
pattern does not match the empty JSON string (""
).
-
For all of these predicates, a pattern that is the empty string (
""
) matches data that is the empty string. And for all exceptlike_regex
, a pattern that is a nonempty string does not match data that is the empty string. Forlike_regex
a nonempty pattern does match empty-string data. -
-
A relative path expression, followed by
in
, followed by a value list, meaning that the value is one of those in the value list.
A comparison predicate is
==
,<>
,!=
Foot 2,<
,<=
,>=
, or>
, meaning equals, does not equal, is less than, is less than or equal to, is greater than or equal to, and is greater than, respectively.A SQL/JSON variable is a dollar sign (
$
) followed by the name of a SQL identifier that is bound in aPASSING
clause forjson_exists
. -
- A value list is
(
, followed by a list of one or more scalar values and SQL/JSON variables separated by commas (,
), followed by)
.
The predicates that you can use in filter conditions are thus
&&
,||
,!
,exists
,==
,<>
,!=
,<
,<=
,>=
,>
, andin
.As an example, the filter condition
(a || b) && (!(c) || d < 42)
is satisfied if both of the following criteria are met:-
At least one of the filter conditions
a
andb
is satisfied:(a || b)
. -
Filter condition
c
is not satisfied or the numberd
is less than or equal to 42, or both are true:(!(c) || d < 42)
.
Comparison predicate
!
has precedence over&&
, which has precedence over||
. You can always use parentheses to control grouping.Without parentheses for grouping, the preceding example would be
a || b && !(c) || d < 42
, which would be satisfied if at least one of the following criteria is met:-
Condition
b && !(c)
is satisfied, which means that each of the conditionsb
and!(c)
is satisfied (which in turn means that conditionc
is not satisfied). -
Condition
a
is satisfied. -
Condition
d < 42
is satisfied.
-
At least
one side of a comparison must not be a SQL/JSON variable. The default
type for a comparison is defined at compile time, based on the type(s)
for the non-variable side(s). You can use a type-specifying item method to
override this default with a different type. The type of your matching data is
automatically converted, for the comparison, to fit the determined type (default or
specified by item method). For example, $.a > 5
imposes
numerical comparison because 5
is a number, $.a >
"5"
imposes string comparison because "5"
is a
string.
Note:
Oracle SQL function json_textcontains
provides powerful
full-text search of JSON data. If you need only simple string pattern-matching then
you can instead use a path-expression filter condition with any of these
pattern-matching comparisons: has substring
, starts
with
, like
, like_regex
, or
eq_regex
.
Here are some examples of path expressions, with their meanings spelled out in detail.
-
$
– The context item. -
$.friends
– The value of fieldfriends
of a context-item object. The dot (.
) immediately after the dollar sign ($
) indicates that the context item is a JSON object. -
$.friends[0]
– An object that is the first element of an array that is the value of fieldfriends
of a context-item object. The bracket notation indicates that the value of fieldfriends
is an array. -
$.friends[0].name
– Value of fieldname
of an object that is the first element of an array that is the value of fieldfriends
of a context-item object. The second dot (.
) indicates that the first element of arrayfriends
is an object (with aname
field). -
$.friends[*].name
– Value of fieldname
of each object in an array that is the value of fieldfriends
of a context-item object. -
$.*[*].name
– Fieldname
values for each object in an array value of a field of a context-item object. -
$.friends[3, 8 to 10, 12]
– The fourth, ninth through eleventh, and thirteenth elements of an arrayfriends
(field of a context-item object). The elements are returned in the order in which they are specified: fourth, ninth, tenth, eleventh, thirteenth.If an array to be matched has fewer than 13 elements then there is no match for index
12
. If an array to be matched has only 10 elements then, in addition to not matching index12
, the range8 to 10
is in effect truncated to positions 8 and 9 (elements 9 and 10). -
$.friends[12, 3, 10 to 8, 12]
– The thirteenth, fourth, ninth through eleventh, and thirteenth elements of arrayfriends
, in that order. The elements are returned in the order in which they are specified. The range10 to 8
specifies the same elements, in the same order, as the range8 to 10
. The thirteenth element (at position 12) is returned twice. -
$.friends[last-1, last, last, last]
– The next-to-last, last, last, and last elements of arrayfriends
, in that order. The last element is returned three times. -
$.friends[last to last-1, last, last]
– Same as the previous example. Rangelast to last-1
, which is the same as rangelast-1 to last
, returns the next-to-last through the last elements. -
$.friends[3].cars
– The value of fieldcars
of an object that is the fourth element of an arrayfriends
. The dot (.
) indicates that the fourth element is an object (with acars
field). -
$.friends[3].*
– The values of all of the fields of an object that is the fourth element of an arrayfriends
. -
$.friends[3].cars[0].year
– The value of fieldyear
of an object that is the first element of an array that is the value of fieldcars
of an object that is the fourth element of an arrayfriends
. -
$.friends[3].cars[0]?(@.year > 2016)
– The first object of an arraycars
(field of an object that is the fourth element of an arrayfriends
), provided that the value of its fieldyear
is, or can be converted to, a number greater than 2016. Ayear
value such as"2017"
is converted to the number2017
, which satisfies the test. Ayear
value such as"recent"
fails the test — no match. -
$.friends[3].cars[0]?(@.year.number() > 2016)
– Same as the previous. Item methodnumber()
allows only a number or a string value that can be converted to a number, and that behavior is already provided by numeric comparison predicate>
. -
$.friends[3].cars[0]?(@.year.numberOnly() > 2016)
– Same as the previous, but only if theyear
value is a number. Item methodnumberOnly()
excludes a car with ayear
value that is a string numeral, such as"2017"
. -
$.friends[3]?(@.addresses.city == "San Francisco")
– An object that is the fourth element of an arrayfriends
, provided that it has anaddresses
field whose value is an object with a fieldcity
whose value is the string"San Francisco"
. -
$.friends[*].addresses?(@city starts with "San ").zip
– Zip codes of alladdresses
offriends
, where the name of the addresscity
starts with "San ". (In this case the filter is not the last path step.) -
$..zip
– All values of azip
field, anywhere, at any level. -
$.friends[3]?(@.addresses.city == "San Francisco" && @.addresses.state == "Nevada")
– Objects that are the fourth element of an arrayfriends
, provided that there is a match for an address with acity
of"San Francisco"
and there is a match for an address with astate
of"Nevada"
.Note: The filter conditions in the conjunction do not necessarily apply to the same object — the filter tests for the existence of an object with city San Francisco and for the existence of an object with state Nevada. It does not test for the existence of an object with both city San Francisco and state Nevada. See Using Filters with JSON_EXISTS.
-
$.friends[3].addresses?(@.city == "San Francisco" && @.state == "Nevada")
– An object that is the fourth element of arrayfriends
, provided that object has a match forcity
of"San Francisco"
and a match forstate
of"Nevada"
.Unlike the preceding example, in this case the filter conditions in the conjunction, for fields
city
andstate
, apply to the sameaddresses
object. The filter applies to a givenaddresses
object, which is outside it.
Related Topics
- Using Filters with JSON_EXISTS
- RETURNING Clause for SQL Query Functions
- SQL/JSON Path Expression Item Methods
- SQL/JSON Path Expression Syntax Relaxation
- Diagrams for Basic SQL/JSON Path Expression Syntax
- Wrapper Clause for SQL/JSON Query Functions JSON_QUERY and JSON_TABLE
- ISO 8601 Date, Time, and Duration Support
See Also:
-
ISO 8601 for information about the ISO date formats
-
Oracle Database SQL Language Reference for information about SQL condition
REGEXP LIKE
-
Oracle Database SQL Language Reference for information about SQL condition
LIKE
andLIKE4
character-set semantics
Parent topic: SQL/JSON Path Expression Syntax
15.2.2 SQL/JSON Path Expression Syntax Relaxation
The basic SQL/JSON path-expression syntax is relaxed to allow implicit array wrapping and unwrapping. This means that you need not change a path expression in your code if your data evolves to replace a JSON value with an array of such values, or vice versa. Examples are provided.
Basic SQL/JSON Path Expression Syntax defines the basic SQL/JSON path-expression syntax. The actual path expression syntax supported relaxes that definition as follows:
-
If a path-expression step targets (expects) an array but the actual data presents no array then the data is implicitly wrapped in an array.
-
If a path-expression step targets (expects) a non-array but the actual data presents an array then the array is implicitly unwrapped.
This relaxation allows for the following abbreviation:
[*]
can be elided whenever it precedes the object
accessor, .
, followed by an object field name, with no change in
effect. The reverse is also true: [*]
can always be inserted in
front of the object accessor, .
, with no change in effect.
This means that the object step
[*].
prop
, which stands for the value of
field prop
of each element of a given array of objects, can be
abbreviated as .prop
, and the object step
.prop
, which looks as though it stands for the
prop
value of a single object, stands also for the
prop
value of each element of an array to which the
object accessor is applied.
This is an important feature, because it means that you need not change a path expression in your code if your data evolves to replace a given JSON value with an array of such values, or vice versa.
For example, if your data originally contains objects that have field
Phone
whose value is a single object with fields
type
and number
, the path expression
$.Phone.number
, which matches a single phone number, can still be
used if the data evolves to represent an array of phones. Path expression
$.Phone.number
matches either a single phone object, selecting its
number, or an array of phone objects, selecting the number of each.
Similarly, if your data mixes both kinds of representation — there are some data entries that use a single phone object and some that use an array of phone objects, or even some entries that use both — you can use the same path expression to access the phone information from these different kinds of entry.
Here are some example path expressions from section Basic SQL/JSON Path Expression Syntax, together with an explanation of equivalences.
-
$.friends
– The value of fieldfriends
of either:-
The (single) context-item object.
-
(equivalent to
$[*].friends
) Each object in the context-item array.
-
-
$.friends[0].name
– Value of fieldname
for any of these objects:-
The first element of the array that is the value of field
friends
of the context-item object. -
(equivalent to
$.friends.name
) The value of fieldfriends
of the context-item object. -
(equivalent to
$[*].friends.name
) The value of fieldfriends
of each object in the context-item array. -
(equivalent to
$[*].friends[0].name
) The first element of each array that is the value of fieldfriends
of each object in the context-item array.
The context item can be an object or an array of objects. In the latter case, each object in the array is matched for a field
friends
.The value of field
friends
can be an object or an array of objects. In the latter case, the first object in the array is used. -
-
$.*[*].name
– Value of fieldname
for any of these objects:-
An element of an array value of a field of the context-item object.
-
(equivalent to
$.*.name
) The value of a field of the context-item object. -
(equivalent to
$[*].*.name
) The value of a field of an object in the context-item array. -
(equivalent to
$[*].*[*].name
) Each object in an array value of a field of an object in the context-item array.
-
Related Topics
Parent topic: SQL/JSON Path Expression Syntax
15.3 SQL/JSON Path Expression Item Methods
The Oracle item methods available for a SQL/JSON path expression are described.
An item method is applied to the JSON data that is targeted by (the rest of) the path expression that is terminated by that method. The method is used to transform that data. The SQL function or condition that is passed the path expression uses the transformed data in place of the targeted data. In some cases the application of an item method acts as a filter, removing the targeted data from the result set.
If an item-method conversion fails for any reason, such as its argument being of the wrong type, then the path cannot be matched (it refers to no data), and no error is raised. In particular, this means that such an error is not handled by an error clause in the SQL function or condition to which the path expression is passed.
An item method always transforms the targeted
JSON data to (possibly other) JSON data. But a query using a path expression
(with or without an item method) can return data as a SQL data type that does not
support JSON data. That is the case for a json_value
query or an
equivalent dot-notation query.
-
The return value of SQL/JSON function
json_query
(or ajson_table
column expression that hasjson_query
semantics) is always JSON data, of SQL data typeJSON
,VARCHAR2
,CLOB
, orBLOB
. The default return data type isJSON
if the targeted data is also ofJSON
type. Otherwise, it isVARCHAR2
. -
The return value of SQL/JSON function
json_value
(or ajson_table
column expression that hasjson_value
semantics) is always of a SQL data type other thanJSON
type: a scalar type, an object type, or a collection type; it does not return JSON data. Though the path expression targets JSON data and an item method transforms targeted JSON data to JSON data,json_value
converts the resulting JSON data to a scalar SQL value in a data type that does not necessarily support JSON data. -
A dot-notation query with an item method implicitly applies
json_value
with aRETURNING
clause that specifies a scalar SQL type to the JSON data that is targeted and possibly transformed by an item method. Thus, a dot-notation query with an item method always returns a SQL scalar value.
Application of an Item Method to an Array
With the exception of item methods
count()
, size()
and type()
,
if an array is targeted by an item method then the method is applied to each of
the array elements, not to the array itself. The results of these
applications are returned in place of the array, as multiple values. That is, the
resulting set of matches includes the converted array elements, not the targeted
array.
(This is similar, in its effect, to the implied unwrapping of an array when a non-array is expected for an object step.)
For example, $.a.method()
applies item-method
method()
to each element of array
a
, to convert that element and use it in place of the array.
-
For a
json_value
query that specifies a SQL collection type (varray or nested table) as the return type, an instance of that collection type is returned, corresponding to the JSON array that results from applying the item method to each of the array elements, unless there is a type mismatch with respect to the collection type definition. -
For a
json_value
query that returns any other SQL type, SQLNULL
is returned. This is because mapping the item method over the array elements results in multiple return values, and that represents a mismatch forjson_value
. -
For
json_query
or ajson_table
column expression withjson_query
semantics, you can use a wrapper clause to capture all of the converted array-element values as an array. For example, this query:SELECT json_query('["alpha", 42, "10.4"]', '$[*].stringOnly()' WITH ARRAY WRAPPER) FROM dual;
returns this JSON array:
["alpha", "10.4"]
. The SQL data type returned is the same as the JSON data that was targeted:JSON
,VARCHAR2(4000)
,CLOB
, orBLOB
.
Item methods count()
,
size()
and type()
are exceptional in
this regard. When applied to an array they treat it as such, instead of acting on
its elements. For
example:
SELECT json_value('[19, "Oracle", {"a":1},[1,2,3]]', '$.type()')
FROM dual;
returns the single VARCHAR2
value
'array'
— json_value
returns
VARCHAR2(4000)
by default.
A similar query,
but with json_query
instead of json_value
,
returns the single JSON string "array"
, of whatever SQL data
type is used for the input JSON data: JSON
,
VARCHAR2(4000)
, CLOB
, or
BLOB
.
Note:
The same thing that happens for json_value
(with a
SQL return type other than an object or collection type) happens for a simple
dot notation query. The presence of an item method in dot notation
syntax always results in json_value
, not
json_query
, semantics. This must produce a single scalar
SQL value (which can be used with SQL ORDER BY
, GROUP
BY
, and comparisons or join operations). But an item method applied
to an array value results in multiple values, which json_value
semantics rejects — SQL NULL
is returned.
Item-Method Descriptions
The following item methods are data-type conversion methods:
binary()
, boolean()
,
booleanOnly()
, date()
,
dateWithTime()
, number()
,
numberOnly()
, double()
,
dsInterval()
, float()
,
number()
, numberOnly()
,
string()
, stringOnly()
,
timestamp()
, and ymInterval()
.
A targeted JSON value targeted by a data-type conversion item method is
said to be interpreted as a value of a given SQL data type. This means that,
in a query that has json_value
semantics, it is handled as if it
were controlled by a RETURNING
clause with that SQL data type.
For example, item-method string()
interprets its
target as would json_value
with clause RETURNING
VARCHAR2(4000)
. A Boolean value is thus treated by
string()
as "true"
or
"false"
; a null
value is treated as
"null"
; and a number is represented in a canonical string
form.
The data-type conversion methods with “only” in their name are the same as the
corresponding methods with names without “only”, except that the former convert
only JSON values that are of the given type (e.g.,
number
) to the related SQL data type (e.g.
NUMBER
). The methods without “only” in the name allow
conversion, when possible, of any JSON value to the given SQL data type.
(When an “only” method targets an array, the conversion applies to each array
element, as usual.)
-
abs()
: The absolute value of the targeted JSON number. Corresponds to the use of SQL functionABS
. -
avg()
: The average of all targeted JSON numbers. Item methodnumber()
is first applied implicitly to each of the possibly multiple values. Their average (a singleNUMBER
value) is then returned. Targeted JSON values that cannot not be converted to numbers are ignored. -
binary()
: A SQLRAW
interpretation of the targeted JSON value. Only JSON data stored asJSON
type matches. -
boolean()
: A SQLVARCHAR2(20)
interpretation of the targeted JSON value. -
booleanOnly()
: A SQLVARCHAR2(20)
interpretation of the targeted JSON data, but only if it is a JSON Boolean value; otherwise, there is no match. Acts as a filter, allowing matches only for JSON Boolean values. -
ceiling()
: The targeted JSON number, rounded up to the nearest integer. Corresponds to the use of SQL functionCEIL
. -
count()
: The number of targeted JSON values, regardless of their types. -
date()
: A SQLDATE
interpretation of the targeted JSON string. The targeted string data must be in a supported ISO 8601 format for a date or a date with time; otherwise, there is no match. If the JSON string has an ISO 8601 date-with-time format then the SQLDATE
instance has its time component truncated (set to zero). -
dateWithTime()
: Likedate()
, except that the time component of an ISO 8601 date-with-time format is preserved in the SQLDATE
instance. -
double()
: A SQLBINARY_DOUBLE
interpretation of the targeted JSON string or number. dsInterval()
: A SQLINTERVAL DAY TO SECOND
interpretation of the targeted JSON string. The targeted string data must be in one of the supported ISO 8601 duration formats; otherwise, there is no match.-
float()
: A SQLBINARY_FLOAT
interpretation of the targeted JSON string or number. Only JSON data stored asJSON
type matches. -
floor()
: The targeted JSON number, rounded down to the nearest integer. Corresponds to the use of SQL functionFLOOR
. -
length()
: The number of characters in the targeted JSON string, interpreted as a SQLNUMBER
. -
lower()
: The lowercase string that corresponds to the characters in the targeted JSON string. -
maxNumber()
: The maximum of all targeted JSON numbers. Item methodnumber()
is first applied implicitly to each of the possibly multiple values. Their maximum (a singleNUMBER
value) is then returned. Targeted JSON values that cannot be converted to numbers are ignored. -
maxString()
: The greatest of all targeted JSON strings, using collation order. Item methodstring()
is first applied implicitly to each of the possibly multiple values. The greatest of these (a singleVARCHAR2
value) is then returned. Targeted JSON values that cannot be converted to strings are ignored. -
minNumber()
: The minimum of all targeted JSON numbers. Item methodnumber()
is first applied implicitly to each of the possibly multiple values. Their minimum (a singleNUMBER
value) is then returned. Targeted JSON values that cannot be converted to numbers are ignored. -
minString()
: The least of all targeted JSON strings, using collation order. Item methodstring()
is first applied implicitly to each of the possibly multiple values. The least of these (a singleVARCHAR2
value) is then returned. Targeted JSON values that cannot be converted to strings are ignored. -
number()
: A SQLNUMBER
interpretation of the targeted JSON string or number. -
numberOnly()
: A SQLNUMBER
interpretation of the targeted JSON data, but only if it is a JSON number; otherwise, there is no match. Acts as a filter, allowing matches only for JSON numbers. -
size()
: If multiple JSON values are targeted then the result of applyingsize()
to each targeted value. Otherwise:-
If the single targeted value is a scalar then 1.
-
If the single targeted value is an array then the number of array elements.
-
If the single targeted value is an object then 1.
This item method can be used with
json_query
, in addition tojson_value
andjson_table
. If applied to data that is an array, no implicit iteration over the array elements occurs: the resulting value is just the number of array elements. (This is an exception to the rule of implicit iteration.) -
-
string()
: A SQLVARCHAR2(4000)
interpretation of the targeted scalar JSON value. -
stringOnly()
: A SQLVARCHAR2(4000)
interpretation of the targeted scalar JSON value, but only if it is a JSON string; otherwise, there is no match. Acts as a filter, allowing matches only for JSON strings. -
sum()
: The sum of all targeted JSON numbers. Item methodnumber()
is first applied implicitly to each of the possibly multiple values. Their sum (a singleNUMBER
value) is then returned. Targeted JSON values that cannot be converted to numbers are ignored. -
timestamp()
: A SQLTIMESTAMP
interpretation of the targeted JSON string. The targeted string data must be in a supported ISO 8601 format for a date or a date with time; otherwise, there is no match. Foot 3 -
type()
: The name of the JSON data type of the targeted data, interpreted as a SQLVARCHAR2(20)
value. This item method can be used withjson_query
, in addition tojson_value
andjson_table
. If applied to data that is an array, no implicit iteration over the array elements occurs: the resulting value is"array"
. (This is an exception to the rule of implicit iteration.)-
"null"
for a value ofnull
. -
"boolean"
for a Boolean value (true
orfalse
). -
"number"
for a number. -
"string"
for a string. -
"array"
for an array. -
"object"
for an object. -
"double"
for a number that corresponds to a SQLBINARY_DOUBLE
value. (ForJSON
type data only.) -
"float"
for a number that corresponds to a SQLBINARY_FLOAT
value. (ForJSON
type data only.) -
"binary"
for a value corresponds to a SQLRAW
value. (ForJSON
type data only.) -
"date"
for a value corresponds to a SQLDATE
value. (ForJSON
type data only.) -
"timestamp"
for a value corresponds to a SQLTIMESTAMP
value. (ForJSON
type data only.) -
"daysecondInterval"
for a value corresponds to a SQLINTERVAL DAY TO SECOND
value. (ForJSON
type data only.) -
"yearmonthInterval"
for a value corresponds to a SQLINTERVAL YEAR TO MONTH
value. (ForJSON
type data only.)
-
-
upper()
: The uppercase string that corresponds to the characters in the targeted JSON string. ymInterval()
: A SQLINTERVAL YEAR TO MONTH
interpretation of the targeted JSON string. The targeted string data must be in one of the supported ISO 8601 duration formats; otherwise, there is no match.
Item methods binary()
, boolean()
,
booleanOnly()
, date()
,
dateWithTime()
, dsInterval()
,
float()
, length()
, lower()
,
number()
, numberOnly()
,
string()
, stringOnly()
,
timestamp()
, upper()
, and
ymInterval()
are Oracle extensions to the SQL/JSON
standard. The other item methods, abs()
,
ceiling()
, double()
, floor()
,
size()
, and type()
are part of the
standard.
Item methods avg()
, count()
,
maxNumber()
, minNumber()
,
maxString()
, minString()
, and
sum()
are aggregate item methods. Instead of acting
individually on each targeted value they act on all targeted values together.
For example, if a path expression targets multiple values that can be converted to
numbers then sum()
returns the sum of those numbers.
Note that when a path
expression targets an array, applying an aggregate item method to it, the
array is handled as a single value — there is no implicit iteration over the
array elements. For example, count()
counts any targeted array as
one value, and size()
returns the size of the array, not the sizes
of its elements.
If you want an aggregate item method to act on
the array elements then you need to explicitly iterate over those elements, using
wildcard *
. For example, if the value of field
LineItems
in a given document is an array then
$.LineItems.count()
returns 1
, but
$.LineItems[*].count()
returns the number of array
elements.
An aggregate item method applies to a single JSON document at a time, just like the path expression (or dot-notation) of which it is part. It aggregates the multiple values that the path expression targets in that document. In a query it returns a row for each document. It does not aggregate information across multiple documents, returning a single row for all documents, as do SQL aggregate functions. See Example 15-1 and Example 15-2.
Item Methods and JSON_VALUE RETURNING Clause
Because some item methods
interpret the targeted JSON data as if it were of a SQL data type, they can be used
with json_value
in place of a RETURNING
clause,
and they can be used with json_table
in place of a column type
specification. That is, the item methods can be used to specify the returned SQL
data type for the extracted JSON data.
You can also use such item
methods together with a json_value
RETURNING
clause or a json_table
column type
specification. What happens if the SQL data type to use for extracted JSON data is
controlled by both an item method and either a json_value
RETURNING
clause or a json_table
column
type?
-
If the two data types are compatible then the data type for the
RETURNING
clause or the column is used. For these purposes,VARCHAR2
is compatible with bothVARCHAR2
andCLOB
. -
If the data types are incompatible then a static, compile-time error is raised.
Table 15-1 Compatibility of Type-Conversion Item Methods and RETURNING Types
Item Method | Compatible RETURNING Clause Data Types |
---|---|
string() ,
stringOnly() , minString() , or
maxString() |
VARCHAR2 or CLOB ,
except that string() returns SQL
NULL for a JSON null
value
|
number() ,
numberOnly() , avg() ,
sum() , count() ,
minNumber() , or
maxNumber() |
NUMBER |
double() |
BINARY_DOUBLE |
float() |
BINARY_FLOAT |
date() |
DATE , with truncated time component
(set to zero), corresponding to RETURNING DATE TRUNCATE
TIME |
dateWithTime() |
DATE , with time component,
corresponding to RETURNING DATE PRESERVE
TIME |
timestamp() |
TIMESTAMP |
ymInterval() |
INTERVAL YEAR TO MONTH |
dsInterval() |
INTERVAL DAY TO SECOND |
boolean() or
booleanOnly() |
VARCHAR2 |
binary() |
RAW |
Using a RETURNING
clause or a column specification, you
can specify a length for character data and a precision and scale for numerical
data. This lets you assign a more precise SQL data type for extraction than what is
provided by an item method for target-data comparison purposes.
For example, if you use item method string()
and RETURNING
VARCHAR2(150)
then the data type of the returned data is
VARCHAR2(150)
, not
VARCHAR2(4000)
.
Example 15-1 Aggregating Values of a Field for Each Document
This example uses item method
avg()
to aggregate the values of field
Quantity
across all LineItems
elements of a
JSON document, returning the average for each document as a separate result
row.
SELECT json_value(po_document,
'$.LineItems[*].Quantity.avg()')
FROM j_purchaseorder;
Example 15-2 Aggregating Values of a Field Across All Documents
This example uses SQL function
avg
to aggregate the average line-item
Quantity
values for all JSON documents, returning the overall
average for the entire document collection as a single row. The average
quantity for all line items of a given document is computed using item method
avg()
.
SELECT avg(json_value(po_document,
'$.LineItems[*].Quantity.avg()'))
FROM j_purchaseorder;
Related Topics
- Basic SQL/JSON Path Expression Syntax
- Simple Dot-Notation Access to JSON Data
- ISO 8601 Date, Time, and Duration Support
- Types in Comparisons
- RETURNING Clause for SQL Query Functions
- SQL/JSON Function JSON_VALUE
- SQL/JSON Function JSON_TABLE
- Wrapper Clause for SQL/JSON Query Functions JSON_QUERY and JSON_TABLE
Parent topic: SQL/JSON Path Expressions
15.4 Types in Comparisons
Comparisons in SQL/JSON path-expression filter conditions are statically typed at compile time. If the effective types of the operands of a comparison are not known to be the same then an attempt is sometimes made to reconcile them by type-casting.
A SQL/JSON path expression targets JSON data, so the operands of a comparison are JSON values. Type comparison of JSON values is straightforward: JSON data types string, number, null, object, and array are mutually exclusive and incomparable.
But comparison operands are sometimes interpreted (essentially cast)
as values of SQL data types. This is the case, for example, when some item methods, such
as number()
, are used. This section addresses the type-checking of such
effective values.
You can prevent such type-casting by explicitly using one of the “only” item
methods. For example, applying method numberOnly()
prevents implicit
type-casting to a number.
SQL is a statically typed language; types are determined at compile time. The same applies to SQL/JSON path expressions, and in particular to comparisons in filter conditions. This means that you get the same result for a query regardless of how it is evaluated — whether functionally or using features such as indexes, materialized views, and In-Memory scans.
To realize this:
-
If the types of both operands are known and they are the same then type-checking is satisfied.
-
If the types of both operands are unknown then a compile-time error is raised.
-
If the type of one operand is known and the other is unknown then the latter operand is cast to the type of the former.
For example, in
$.a?(@.b.c == 3)
the type of$a.b.c
is unknown at compile time. The path expression is compiled as$.a?(@.b.c.number() == 3)
. At runtime an attempt is thus made to cast the data that matches$a.b.c
to a number. A string value"3"
would be cast to the number3
, satisfying the comparison.Foot 4 -
If the types of both operands are known and they are not the same then an attempt is made to cast the type of one to the type of the other. Details are presented below.
An attempt is made to reconcile comparison operands used in the following combinations, by type-casting. You can think of a type-casting item method being implicitly applied to one of the operands in order to make it type-compatible with the other operand.
-
Number compared with double —
double()
is implicitly applied to the number to make it a double value. -
Number compared with float —
float()
is implicitly applied to the number to make it a float value. -
String in a supported ISO 8601 format compared with date —
date()
is implicitly applied to the string to make it a date value. For this, the UTC time zone (Coordinated Universal Time, zero offset) is used as the default, taking into account any time zone specified in the string. -
String in a supported ISO 8601 format compared with timestamp without time zone —
timestamp()
is implicitly applied to the string to make it a timestamp value. For this, the UTC time zone (Coordinated Universal Time, zero offset) is used as the default, taking into account any time zone specified in the string.
Comparison operands used in the following combinations are not reconciled; a compile-time error is raised.
-
Number, double, or float compared with any type other than number, double, or float.
-
Boolean compared with any type other than Boolean.
-
Date or timestamp compared with string, unless the string has a supported ISO 8601 format.
-
Date compared with any non-date type other than string (in supported ISO 8601 format).
-
Timestamp (with or without time zone) compared with any non-timestamp type other than string (in supported ISO 8601 format).
- Timestamp compared with timestamp with time zone.
-
JSON null type compared with any type other than JSON null.
Parent topic: SQL/JSON Path Expressions
Footnote Legend
Footnote 1: Theto
in a range
specification is sometimes informally called the array slice
operator.Footnote 2:
!=
is an Oracle alias for the SQL/JSON standard comparison
predicate <>
.Footnote 3: Applying item method
timestamp()
to a supported ISO 8601 string
<ISO-STRING>
has the effect of SQL
sys_extract_utc(to_utc_timestamp_tz(<ISO-STRING>)
.Footnote 4: To prevent such casting here, you can explicitly apply item method
numberOnly()
: $.a?(@.b.c.numberOnly() ==
3)
. Data with a string value "3"
would simply
not match; it would be filtered out.