WARNING

This text was automatically converted from troff me macros to HTML. Information may have been lost, added, or changed in the process. Lars Aronsson and Lysator do not guarantee the correctness of this HTML document.

NAME

define operator *- define a new user operator

SYNOPSIS

define operator operator_name
 ( arg1 = type-1
  [ , arg2 = type-2 ]
  , procedure = func_name
  [, precedence = number ]
  [, associativity = (left | right | none | any) ]
  [, commutator = com_op ]
  [, negator = neg_op ]
  [, restrict = res_proc ]
  [, hashes]
  [, join = join_proc ]
  [, sort = sor_op1 {, sor_op2 } ]
 )

DESCRIPTION

This command defines a new user operator, "operator_name" . The user who defines an operator becomes its owner.

The name of the operator, operator_name , can be composed of symbols only. Also, the func_name procedure must have been previously defined using "define function" and must have one or two arguments. The types of the arguments for the operator and the type of the answer are as defined by the function. Precedence refers to the order that multiple instances of the same operator are evaluated. The next several fields are primarily for the use of the query optimizer.

The associativity value is used to indicate how an expression containing this operator should be evaluated when precedence and explicit grouping are insufficient to produce a complete order of evaluation. Left and right indicate that expressions containing the operator are to be evaluated from left to right or from right to left, respectively. None means that it is an error for this operator to be used without explicit grouping when there is ambiguity. And any , the default, indicates that the optimizer may choose to evaluate an expression which contains this operator arbitrarily.

The commutator operator is present so that POSTGRES can reverse the order of the operands if it wishes. For example, the operator area-less-than, >>>, would have a commutator operator, area-greater-than, <<<. Suppose that an operator, area-equal, ===, exists, as well as an area not equal, !==. Hence, the query optimizer could freely convert:

"0,0,1,1"::box >>> MYBOXES.description
to
MYBOXES.description <<< "0,0,1,1"::box
This allows the execution code to always use the latter representation and simplifies the query optimizer somewhat.

The negator operator allows the query optimizer to convert

not MYBOXES.description === "0,0,1,1"::box
to
MYBOXES.description !== "0,0,1,1"::box

If a commutator operator name is supplied, POSTGRES searches for it in the catalog. If it is found and it does not yet have a commutator itself, then the commutator's entry is updated to have the current (new) operator as its commutator. This applies to the negator, as well.

This is to allow the definition of two operators that are the commutators or the negators of each other. The first operator should be defined without a commutator or negator (as appropriate). When the second operator is defined, name the first as the commutator or negator. The first will be updated as a side effect.

The next two specifications are present to support the query optimizer in performing joins. POSTGRES can always evaluate a join (i.e., processing a clause with two tuple variables separated by an operator that returns a boolean) by iterative substitution [WONG76]. In addition, POSTGRES is planning on implementing a hash-join algorithm along the lines of [SHAP86]; however, it must know whether this strategy is applicable. For example, a hash-join algorithm is usable for a clause of the form:

MYBOXES.description === MYBOXES2.description
but not for a clause of the form:
MYBOXES.description <<< MYBOXES2.description.
The hashes flag gives the needed information to the query optimizer concerning whether a hash join strategy is usable for the operator in question.

Similarly, the two sort operators indicate to the query optimizer whether merge-sort is a usable join strategy and what operators should be used to sort the two operand classes. For the === clause above, the optimizer must sort both relations using the operator, <<<. On the other hand, merge-sort is not usable with the clause:

MYBOXES.description <<< MYBOXES2.description
If other join strategies are found to be practical, POSTGRES will change the optimizer and run-time system to use them and will require additional specification when an operator is defined. Fortunately, the research community invents new join strategies infrequently, and the added generality of user-defined join strategies was not felt to be worth the complexity involved.

The last two pieces of the specification are present so the query optimizer can estimate result sizes. If a clause of the form:

MYBOXES.description <<< "0,0,1,1"::box
is present in the qualification, then POSTGRES may have to estimate the fraction of the instances in MYBOXES that satisfy the clause. The function res_proc must be a registered function (meaning it is already defined using "define function" ) which accepts one argument of the correct data type and returns a floating point number. The query optimizer simply calls this function, passing the parameter
"0,0,1,1"
and multiplies the result by the relation size to get the desired expected number of instances.

Similarly, when the operands of the operator both contain instance variables, the query optimizer must estimate the size of the resulting join. The function join_proc will return another floating point number which will be multiplied by the cardinalities of the two classes involved to compute the desired expected result size.

The difference between the function

my_procedure_1 (MYBOXES.description, "0,0,1,1"::box)
and the operator
MYBOXES.description === "0,0,1,1"::box
is that POSTGRES attempts to optimize operators and can decide to use an index to restrict the search space when operators are involved. However, there is no attempt to optimize functions, and they are performed by brute force. Moreover, functions can have any number of arguments while operators are restricted to one or two.


EXAMPLE

/* The following command defines a new operator, */
/* area-equality, for the BOX data type.         */

define operator === (
 arg1 = box,
 arg2 = box,
 procedure = area_equal_procedure,
 precedence = 30,
 associativity = left,
 commutator = ===,
 negator = !==,
 restrict = area_restriction_procedure,
 hashes,
 join = area-join-procedure,
 sort = <<<, <<<)

SEE ALSO

remove operator(commands), define function(commands).

BUGS

Operator names cannot be composed of alphabetic characters in Version 4.0.

Operator precedence and associativity are not implemented in Version 4.0.