Chapter 19. Putting it all together: notes on the structure of Relojban texts

19.1. Introductory

This chapter is incurably miscellaneous. It describes the cmavo that specify the structure of Relojban texts, from the largest scale (paragraphs) to the smallest (single words). There are fewer examples than are found in other chapters of this book, since the linguistic mechanisms described are generally made use of in conversation or else in long documents.

This chapter is also not very self-contained. It makes passing reference to a great many concepts which are explained in full only in other chapters. The alternative would be a chapter on text structure which was as complex as all the other chapters put together. Relojban is a unified language, and it is not possible to understand any part of it (in full) before understanding every part of it (to some degree).

19.2. Sentences: I

The following cmavo is discussed in this section:

.i

I

sentence separator

Since Relojban is audio-visually isomorphic, there needs to be a spoken and written way of signaling the end of a sentence and the start of the following one. In written English, a period serves this purpose; in spoken English, a tone contour (rising or falling) usually does the job, or sometimes a long pause. Relojban uses a single separator: the cmavo i (of selma'o I):

Example 19.1.

mi	klama	lo	zarci	.i	do	cadzu	lo	bisli
I	go-to	the	store.		You	walk-on	the	ice.

The word “separator” should be noted. i is not normally used after the last sentence nor before the first one, although both positions are technically grammatical. i signals a new sentence on the same topic, not necessarily by the same speaker. The relationship between the sentences is left vague, except in stories, where the relationship usually is temporal, and the following sentence states something that happened after the previous sentence.

Note that although the first letter of an English sentence is capitalized, the cmavo i is never capitalized. In writing, it is appropriate to place extra space before i to make it stand out better for the reader. In some styles of Relojban writing, every i is placed at the beginning of a line, possibly leaving space at the end of the previous line.

An i cmavo may or may not be used when the speaker of the following sentence is different from the speaker of the preceding sentence, depending on whether the sentences are felt to be connected or not.

An i cmavo can be followed by a logical or non-logical connective (a jek or joik), a modal or tense connective, or both: these constructs are explained in Section 9.8, Section 10.16, and Section 14.4. In all cases, the i comes first in the structure. Attitudinals can also be attached to an i if they are meant to apply to the whole sentence: see Section 13.9.

There exist a pair of mechanisms for binding a sequence of sentences closely together. If the i (with or without connectives) is followed by bo (of selma'o BO), then the two sentences being separated are understood to be more closely grouped than sentences connected by i alone.

Similarly, a group of sentences can be preceded by tu'e (of selma'o TUhE) and followed by tu'u (of selma'o TUhU) to fuse them into a single unit. A common use of tu'e…tu'u is to group the sentences which compose a poem: the title sentence would precede the group, separated from it by i. Another use might be a set of directions, where each numbered direction might be surrounded by tu'e…tu'u and contain one or more sentences separated by i. Grouping with tu'e and tu'u is analogous to grouping with ke and ke'e to establish the scope of logical or non-logical connectives (see Section 14.8).

19.3. Paragraphs: NIhO

The following cmavo are discussed in this section:

ni'o	NIhO	new topic
no'i	NIhO	old topic
da'o	DAhO	cancel cmavo assignments

The paragraph is a concept used in writing systems for two purposes: to indicate changes of topic, and to break up the hard-to-read appearance of large blocks of text on the page. The former function is represented in both spoken and written Relojban by the cmavo ni'o and no'i, both of selma'o NIhO. Of these two, ni'o is the more common. By convention, written Relojban is broken into paragraphs just before any ni'o or no'i, but a very long passage on a single topic might be paragraphed before an i. On the other hand, it is conventional in English to start a new paragraph in dialogue when a new speaker starts, but this convention is not commonly observed in Relojban dialogues. Of course, none of these conventions affect meaning in any way.

A ni'o can take the place of an i as a sentence separator, and in addition signals a new topic or paragraph. Grammatically, any number of ni'o cmavo can appear consecutively and are equivalent to a single one; semantically, a greater number of ni'o cmavo indicates a larger-scale change of topic. This feature allows complexly structured text, with topics, subtopics, and sub-subtopics, to be represented clearly and unambiguously in both spoken and written Relojban.

In spoken text, which is inherently less structured, a single ni'o might have the effect of multiple ones in a written work. Normally, a written text will begin with the number of ni'o cmavo needed to signal the largest scale division which the text contains. ni'o strings may be subscripted to label each context of discourse: see Section 19.6.

no'i is similar in effect to ni'o, but indicates the resumption of a previous topic. In speech, it is analogous to (but much shorter than) such English discursive phrases as “But getting back to the point ...”. By default, the topic resumed is that in effect before the last ni'o. When subtopics are nested within topics, then no'i would resume the previous subtopic and no'i no'i the previous topic.

If a ni'o is subscripted, then a no'i with the same subscript is assumed to be a continuation of it. A no'i may also have a negative subscript, which would specify counting backwards a number of paragraphs and resuming the topic found thereby.

19.4. Topic-comment sentences: ZOhU

The following cmavo is discussed in this section:

zo'u

ZOhU

topic/comment separator

The normal Relojban sentence is just a bridi, parallel to the normal English sentence which has a subject and a predicate:

Example 19.2.

mi

klama

lo

zarci

I went-to the market

In Chinese, the sentence form can be different: a topic is stated, and a comment about it is made. (Japanese also has the concept of a topic, but indicates it by attaching a suffix; other languages also distinguish topics in various ways.) The topic says what the sentence is about:

Example 19.3.

水果我最喜欢香蕉。
shuǐguǒ wǒ zuì xǐhuān xiāngjiāo
fruits: I mostly like bananas
As for fruits I like bananas most.

The colon in the first translation of Example 19.3 separates the topic (“fruits”) from the comment (“I mostly like bananas”).

Relojban uses the cmavo zo'u (of selma'o ZOhU) to separate topic (a sumti) from comment (a bridi):

Example 19.4.

lo grute	zo'u	mi	traji	nelci	lo badna
Fruits	:	I	most	like	bananas.

Example 19.4 is the literal Relojban translation of Example 19.3. Of course, the topic-comment structure can be changed to a straightforward bridi structure:

Example 19.5.

lo badna	cu	traji	lo ka	se nelci	mi	vau		lo grute
Bananas		are superlative in	(being	liked by	me	)	among	fruits.

Example 19.5 means the same as Example 19.4, and it is more straightforward in the structure. However, often the position of the topic in the place structure of the selbri within the comment is vague:

Example 19.6.

lo finpe	zo'u	citka
Fish	:	eat

Is the fish eating or being eaten? The sentence doesn't say. The Chinese equivalent of Example 19.6 is:

Example 19.7.

鱼吃
yú chī
fish: eat

which is vague in exactly the same way.

It is possible to have more than one sumti before zo'u. This is necessary in the other use of zo'u: to separate a quantifying section from a bridi containing quantified variables. This usage belongs to a discussion of quantifier logic in Relojban (see Section 16.2), but an example would be:

Example 19.8.

ro	da	poi	prenu	ku'o
For-all	X	which	are-persons,

su'o	de	zo'u	de	patfu	da
there-exists-a	Y	such-that	Y	is-the-father-of	X.

Every person has a father.

The string of sumti before zo'u (called the “prenex”: see Section 16.2) may contain both a topic and bound variables:

Example 19.9.

	lo	si'o	patfu	kei	ro	da	poi	prenu	ku'o
For	the	idea-of	fathers		for-all	X	which	are-persons,

su'o	de	zo'u	de	patfu	da
there-exists-a	Y	such-that	Y	is-the-father-of	X.

As for fathers, every person has one.

To specify a topic which affects more than one sentence, wrap the sentences in tu'e…tu'u brackets and place the topic and the zo'u directly in front.

Example 19.10.

lo jdini	zo'u	tu'e		ponse	.i na ja	djica	[tu'u]
Money	:	(	[if]	possess,	then	want	)

Money: if you have it, you want it.

Note: In Relojban, you do not “want money”; you “want to have money” or something of the sort, as the x₂ place of djica demands an event. As a result, the straightforward rendering of Example 19.9 without a topic is not:

Example 19.11.

do	ponse	lo jdini	.i na ja	do	djica	ri
You	possess	money	only-if	you	desire	its-mere-existence.

where ri means lo jdini and is interpreted as “the mere existence of money”, but rather:

Example 19.12.

do	ponse	lo jdini	.i na ja	do	djica	tu'a	ri
You	possess	money	only-if	you	desire	something-about	it.

namely, the possession of money. But topic-comment sentences like Example 19.10 are inherently vague, and this difference between ponse (which expects a physical object in x₂) and djica is ignored. See Example 19.46 for another topic/comment sentence.

The subject of an English sentence is often the topic as well, but in Relojban the sumti in the x₁ place is not necessarily the topic. Thus Relojban sentences don't necessarily have a “subject” in the English sense.

19.5. Questions and answers

The following cmavo are discussed in this section:

xu	UI	truth question
ma	KOhA	sumti question
mo	GOhA	bridi question
xo	PA	number question
ji	A	sumti connective question
ge'i	GA	forethought connective question
gi'i	GIhA	bridi-tail connective question
gu'i	GUhA	tanru forethought connective question
je'i	JA	tanru connective question
pei	UI	attitude question
fi'a	FA	place structure question
cu'e	CUhE	tense/modal question
pau	UI	question premarker

Relojban questions are not at all like English questions. There are two basic types: truth questions, of the form “Is it true that ...”, and fill-in-the-blank questions. Truth questions are marked by preceding the bridi, or following any part of it specifically questioned, with the cmavo xu (of selma'o UI):

Example 19.13.

xu	do	klama	lo	zarci
[True-or-false?]	You	go-to	the	store

Are you going to the store/Did you go to the store?

(Since the Relojban is tenseless, either colloquial translation might be correct.) Truth questions are further discussed in Section 15.8.

Fill-in-the-blank questions have a cmavo representing some Relojban word or phrase which is not known to the questioner, and which the answerer is to supply. There are a variety of cmavo belonging to different selma'o which provide different kinds of blanks.

Where a sumti is not known, a question may be formed with ma (of selma'o KOhA), which is a kind of sumka'i:

Example 19.14.

ma	klama	lo	zarci
[What-sumti?]	goes-to	the	store

Who is going to the store?

Of course, the ma need not be in the x₁ place:

Example 19.15.

do	klama	ma
You	go-to	[what-sumti?]

Where are you going?

The answer is a simple sumti:

Example 19.16.

lo zarci

The store.

A sumti, then, is a legal utterance, although it does not by itself constitute a bridi – it does not claim anything, but merely completes the open-ended claim of the previous bridi.

There can be two ma cmavo in a single question:

Example 19.17.

ma klama ma

Who goes where?

and the answer would be two sumti, which are meant to fill in the two ma cmavo in order:

Example 19.18.

mi	lo	zarci
I,	[to]-the	store.

An even more complex example, depending on the non-logical connective fa'u (of selma'o JOI), which is like the English “and ... respectively”:

Example 19.19.

ma

fa'u

ma

klama

ma

fa'u

ma

Who and who goes where and where, -respectively?

An answer might be

Example 19.20.

la	.djan.	la	.marcas.	le	zarci	le	briju
	John,		Marsha,	the	store,	the	office.

John and Marsha go to the store and the office, respectively.

Questions to be answered with a selbri are expressed with mo of selma'o GOhA, which is a kind of brika'i:

Example 19.21.

la	.relojban.	cu	mo
	Relojban		[what-selbri?]

What is Relojban? / What about Relojban? / What does Relojban do?

Here the answerer is to supply some predicate which is true of Relojban. Such questions are extremely open-ended, due to the enormous range of possible predicate answers. The answer might be just a selbri, or might be a full bridi, in which case the sumti in the answer override those provided by the questioner. To limit the range of a mo question, make it part of a tanru.

Questions about numbers are expressed with xo of selma'o PA:

Example 19.22.

do	viska	xo	prenu
You	saw	[what-number?]	persons.

How many people did you see?

The answer would be a simple number, another kind of non-bridi utterance:

Example 19.23.

vomu

Forty-five.

Fill-in-the-blank questions may also be asked about: logical connectives (using cmavo ji of A, ge'i of GA, gi'i of GIhA, gu'i of GUhA, or je'i of JA, and receiving an ek, gihek, ijek, or ijoik as an answer) – see Section 14.13; attitudes (using pei of UI, and receiving an attitudinal as an answer) – see Section 13.10; place structures (using fi'a of FA, and receiving a cmavo of FA as an answer) – see Section 9.3; tenses and modals (using cu'e of CUhE, and receiving any tense or BAI cmavo as an answer) – see Section 9.6 and Chapter 10.

Questions can be marked by placing pau (of selma'o UI) before the question bridi. See Section 13.13 for details.

The full list of non-bridi utterances suitable as answers to questions is:

any number of sumti (with elidable terminator vau, see Chapter 6)
any afterthought connective (see Chapter 14)
a number, or any mathematical expression placed in parentheses (see Chapter 18)
a bare na negator (to negate some previously expressed bridi), or corresponding ja'a affirmer (see Chapter 15)
a relative clause (to modify some previously expressed sumti, see Chapter 8)
a prenex/topic (to modify some previously expressed bridi, see Chapter 16)
linked arguments (beginning with be or bei and attached to some previously expressed selbri, often in a description, see Section 5.7)
indicators (to express a prevailing attitude, see Chapter 13)

Where not needed for the expression of answers, most of these are made grammatical for pragmatic reasons: people will say them in conversation, and there is no reason to rule them out as ungrammatical merely because most of them are vague.

19.6. Subscripts: XI

The following cmavo is discussed in this section:

xi

XI

subscript

The cmavo xi (of selma'o XI) indicates that a subscript (a number, a lerfu string, or a parenthesized mekso) follows. Subscripts can be attached to almost any construction and are placed following the construction (or its terminator word, which is generally required). They are useful either to extend the finite cmavo list to infinite length, or to make more refined distinctions than the standard cmavo list permits. The remainder of this section mentions some places where subscripts might naturally be used.

Relojban gismu have at most five places:

Example 19.24.

mi

cu

klama

lo

zarci

lo

zdani

lo

dargu

lo

karce

I

go

to

the

market

from

the

house

via

the

road

using

the

car.

Consequently, selma'o SE (which operates on a selbri to change the order of its places) and selma'o FA (which provides place number tags for individual sumti) have only enough members to handle up to five places. Conversion of Example 19.24, using xe to swap the x₁ and x₅ places, would produce:

Example 19.25.

lo	karce	cu	xe-klama		lo	zarci
The	car		is-a-transportation-means	to	the	market

	lo	zdani		lo	dargu		mi
from	the	house	via	the	road	for	me.

And reordering of the place structures might produce:

Example 19.26.

fo	lo	dargu	fi	lo	zdani	fa	mi
Via	the	road,	from	the	house,		I,

fe	lo	zarci	fu	lo	karce	cu	klama
to	the	market,	using	the	car,		go.

Example 19.24 to Example 19.26 all mean the same thing. But consider the lujvo nunkla, formed by applying the abstraction operator nu to klama:

Example 19.27.

la'e	di'u	cu	nunkla		mi
The-referent-of	the-previous-sentence		is-an-event-of-going	by	me

	lo	zarci		lo	zdani		lo	dargu		lo	karce
to	the	market	from	the	house	via	the	road	using	the	car.

Example 19.27 shows that nunkla has six places: the five places of klama plus a new one (placed first) for the event itself. Performing transformations similar to that of Example 19.25 requires an additional conversion cmavo that exchanges the x₁ and x₆ places. The solution is to use any cmavo of SE with a subscript "6" (Section 19.6):

Example 19.28.

lo	karce	cu	sexixa nunkla		mi
The	car		is-a-transportation-means-in-the-event-of-going	by	me

	lo	zarci		lo	zdani
to	the	market	from	the	house

	lo	dargu		la'e	di'u
via	the	road	the-event-being	that-which-is-referred-to-by	the-last-sentence.

Likewise, a sixth place tag can be created by using any cmavo of FA with a subscript:

Example 19.29.

fu	lo	dargu	fo	lo	zdani	fe	mi
Via	the	road,	from	the	house,	by	me,

fa	la'e	di'u
the-event-being	that-which-is-referred-to-by	the-last-sentence,

fi	lo	zarci	faxixa	lo	karce	cu	nunkla
to	the	market,	using	the	car,		is-an-event-of-going.

Example 19.27 to Example 19.29 also all mean the same thing, and each is derived straightforwardly from any of the others, despite the tortured nature of the English glosses. In addition, any other member of SE or FA could be substituted into sexixa and faxixa without change of meaning: vexixa means the same thing as sexixa.

Relojban provides two groups of sumka'i, both belonging to selma'o KOhA. The ko'a-series cmavo are used to refer to explicitly specified sumti to which they have been bound using goi. The da-series, on the other hand, are existentially or universally quantified variables. (These concepts are explained more fully in Chapter 16.) There are ten ko'a-series cmavo and 3 da-series cmavo available.

If more are required, any cmavo of the ko'a-series or the da-series can be subscripted:

Example 19.30.

da	xi	vo
X	sub	4

is the 4th bound variable of the 1st sequence of the da-series, and

Example 19.31.

ko'i	xi	paso
something-3	sub	18

is the 18th free variable of the 3rd sequence of the ko'a-series. This convention allows 10 sequences of ko'a-type sumka'i and 3 sequences of da-type sumka'i, each with as many members as needed. Note that daxivo and dexivo are considered to be distinct sumka'i, unlike the situation with sexixa and vexixa above. Exactly similar treatment can be given to the bu'a-series of selma'o GOhA and to the gismu brika'i broda, brode, brodi, brodo, and brodu.

Subscripts on lerfu words are used in the standard mathematical way to extend the number of variables:

Example 19.32.

li

xy.xipa

du

li

xy.xire

su'i

xy.xici

The-number

x-sub-1

equals

the-number

x-sub-2

plus

x-sub-3

x₁ = x₂ + x₃

and can be used to extend the number of sumka'i as well, since lerfu strings outside mathematical contexts are grammatically and semantically equivalent to sumka'i of the ko'a-series.

Names, which are similar to sumka'i, can also be subscripted to distinguish two individuals with the same name:

Example 19.33.

la .djan.	xipa	cu	cusku	lu	mi'e	nai	do	li'u		la .djan.	xire
John₁		says	[quote]	I-am	not	you	[unquote]	to	John₂	.

Subscripts on tenses allow talking about more than one time or place that is described by the same general cmavo. For example, puxipa could refer to one point in the past, and puxire a second point (earlier or later).

You can place a subscript on the word ja'a, the bridi affirmative of selma'o NA, to express so-called fuzzy truths. The usual machinery for fuzzy logic (statements whose truth value is not merely “true” or “false”, but is expressed by a number in the range 0 to 1) in Relojban is the abstractor jei:

Example 19.34.

li	pimu	jei	mi	ganra
The-number	.5	is-the-truth-value-of	my	being-broad.

However, by convention we can attach a subscript to ja'a to indicate fuzzy truth (or to na if we change the amount):

Example 19.35.

mi	ja'a	xi	pimu	ganra
I	truly	sub-	.5	am-broad

Finally, as mentioned in Section 19.2, ni'o and no'i cmavo with matching subscripts mark the start and the continuation of a given topic respectively. Different topics can be assigned to different subscripts.

Other uses of subscripts will doubtless be devised in future.

19.7. Utterance ordinals: MAI

The following cmavo are discussed in this section:

mai	MAI	utterance ordinal, -thly
mo'o	MAI	higher order utterance ordinal

Numerical free modifiers, corresponding to English “firstly”, “secondly”, and so on, can be created by suffixing mai or mo'o of selma'o MAI to a number or a lerfu string. Here are some examples:

Example 19.36.

mi

klama

lo

zarci

ku

pa

mai

.e

lo

zdani

ku

re

mai

I

go-to

the

store

1

-stly

and

the

house

2

-ndly.

This does not imply that I go to the store before I go to the house: that meaning requires a tense. The sumti are simply numbered for convenience of reference. Like other free modifiers, the utterance ordinals can be inserted almost anywhere in a sentence without affecting its grammar or its meaning.

Any of the Relojban numbers can be used with MAI: ro mai, for example, means “all-thly” or “lastly”. Likewise, if you are enumerating a long list and have forgotten which number is wanted next, you can say ny.mai, or “Nthly”.

The difference between mai and mo'o is that mo'o enumerates larger subdivisions of a text; mai was designed for lists of numbered items, whereas mo'o was intended to subdivide structured works. If this chapter were translated into Relojban, it might number each section with mo'o: this section would then be introduced with ze mo'o, or “Section 7.”

19.8. Attitude scope markers: FUhE/FUhO

The following cmavo are discussed in this section:

fu'e	FUhE	open attitudinal scope
fu'o	FUhO	close attitudinal scope

Relojban has a complex system of “attitudinals”, words which indicate the speaker's attitude to what is being said. The attitudinals include indicators of emotion, intensity markers, discursives (which show the structure of discourse), and evidentials (which indicate “how the speaker knows”). Most of these words belong to selma'o UI; the intensity markers belong to selma'o CAI for historical reasons, but the two selma'o are grammatically identical. The individual cmavo of UI and CAI are discussed in Chapter 13; only the rules for applying them in discourse are presented here.

Normally, an attitudinal applies to the preceding word only. However, if the preceding word is a structural cmavo which begins or ends a whole construction, then that whole construction is affected by the attitudinal:

Example 19.37.

mi	viska	lo	blanu	ia	zdani	[ku]
I	see	the	blue	[belief]	house.

I see the house, which I believe to be blue.

Example 19.38.

mi	viska	lo	blanu	zdani	ia	[ku]
I	see	the	blue	house	[belief].

I see the blue thing, which I believe to be a house.

Example 19.39.

mi	viska	lo	ke	ia	blanu	zdani	[ke'e]	[ku]
I	see	the	[belief]	blue	house

I see what I believe to be a blue house.

Example 19.40.

mi	viska	lo	ia	blanu	zdani	[ku]
I	see	the	[belief]	blue	house

It is a blue house that I believe I see.

Example 19.41.

mi	viska	lo	blanu	zdani	ku	ia
I	see	(the	blue	house)		[belief]

It is a blue house that I believe I see.

An attitudinal meant to cover a whole sentence can be attached to the preceding i, expressed or understood:

Example 19.42.

[.i]	ia	mi	viska	lo	blanu	zdani
	[belief]	I	see	the	blue	house.

I believe I see a blue house.

or to an explicit vau placed at the end of a bridi.

Likewise, an attitudinal meant to cover a whole paragraph can be attached to ni'o or no'i.

However, sometimes it is necessary to be more specific about the range of one or more attitudinals, particularly if the range crosses the boundaries of standard Relojban syntactic constructions. The cmavo fu'e (of selma'o FUhE) and fu'o (of selma'o FUhO) provide explicit scope markers. Placing fu'e in front of an attitudinal disconnects it from what precedes it, and instead says that it applies to all following words until further notice. The notice is given by fu'o, which can appear anywhere and cancels all attitudinals introduced by the last fu'e. For example:

Example 19.43.

mi	viska	lo	fu'e	ia	blanu	zdani	fu'o	ponse
I	see	the	[start]	[belief]	blue	house	[end]	possessor

I see the owner of what I believe to be a blue house.

Here, only the blanu zdani portion of the three-part tanru blanu zdani ponse is marked as a belief of the speaker. Naturally, the attitudinal scope markers do not affect the rules for interpreting multi-part tanru: blanu zdani groups first because tanru group from left to right unless overridden with ke or bo.

Other attitudinals of more local scope can appear after attitudinals marked by FUhE; these attitudinals are added to the globally active attitudinals rather than superseding them.

19.9. Quotations: LU, LIhU, LOhU, LEhU

The following cmavo are discussed in this section:

lu	LU	begin quotation
li'u	LIhU	end quotation
lo'u	LOhU	begin error quotation
le'u	LEhU	end error quotation

Grammatically, quotations are very simple in Relojban: all of them are sumti, and they all mean something like “the piece of text here quoted”:

Example 19.44.

mi	pu	cusku	lu	mi'e	.djan.	[li'u]
I	[past]	express	[quote]	I-am	John	[unquote]

I said, “I'm John”.

But in fact there are four different flavors of quotation in the language, involving six cmavo of six different selma'o. This being the case, quotation deserves some elaboration.

The simplest kind of quotation, exhibited in Example 19.44, uses the cmavo lu (of selma'o LU) as the opening quotation mark, and the cmavo li'u (of selma'o LIhU) as the closing quotation mark. The text between lu and li'u must be a valid, parseable Relojban text. If the quotation is ungrammatical, so is the surrounding expression. The cmavo li'u is technically an elidable terminator, but it's almost never possible to elide it except at the end of text.

The cmavo lo'u (of selma'o LOhU) and le'u (of selma'o LEhU) are used to surround a quotation that is not necessarily grammatical Relojban. However, the text must consist of morphologically correct Relojban words (as defined in Chapter 4), so that the le'u can be picked out reliably. The words need not be meaningful, but they must be recognizable as cmavo, brivla, or cmevla. Quotation with lo'u is essential to quoting ungrammatical Relojban for teaching in the language, the equivalent of the * that is used in English to mark such errors:

Example 19.45.

lo'u	mi du do du la .djan.	le'u
[quote]	mi du do du la .djan.	[unquote]

na	te gerna	la .relojban.
is-not	a-grammatical-structure-in	Relojban.

Example 19.45 is grammatical even though the embedded quotation is not. Similarly, lo'u quotation can quote fragments of a text which themselves do not constitute grammatical utterances:

Example 19.46.

lu	lo mlatu cu viska lo finpe	li'u	zo'u
[quote]	lo mlatu cu viska lo finpe	[unquote]	:

lo'u	viska lo	le'u	cu	selbasti
[quote]	viska lo	[unquote]		is-replaced-by

.ei	lo'u	viska pa	le'u
[obligation!]	[quote]	viska pa	[unquote].

In the sentence lo mlatu cu viska lo finpe, viska lo should be replaced by viska pa.

Note the topic-comment formulation (Section 19.4) and the indicator applying to the selbri only (Section 19.8). Neither viska lo nor viska pa is a valid Relojban utterance, and both require lo'u quotation.

Additionally, sumka'i or brika'i in the quoting sentence can refer to words appearing in the quoted sentence when lu…li'u is used, but not when lo'u ... le'u is used:

Example 19.47.

la	.tcarlis.	cu	cusku	lu	lo	ninmu	cu	morsi	li'u
	Charlie		says	[quote]	the	woman		is-dead	[unquote].

.i ku'i	ri	jmive
However,	the-last-mentioned	is-alive.

Charlie says “The woman is dead”, but she is alive.

In Example 19.47, ri is a sumka'i which refers to the most recent previous sumti, namely lo ninmu. Compare:

Example 19.48.

la	.tcarlis.	cu	cusku	lo'u	lo ninmu cu morsi	le'u
	Charlie		says	[quote]	lo ninmu cu morsi	[unquote].

.i ku'i	ri	jmive
However,	the-last-mentioned	is-alive.

Charlie says lo ninmu cu morsi, but that sentence is alive.

In Example 19.48, ri cannot refer to the referent of the alleged sumti lo ninmu, because lo ninmu cu morsi is a mere uninterpreted sequence of Relojban words. Instead, ri ends up referring to the quoted text, resulting in nonsense.

The metalinguistic erasers si, sa, and su, discussed in Section 19.13, do not operate in text between lo'u and le'u. A lo'u quotation may not begin with bu. Since the first le'u terminates a lo'u quotation, it is not directly possible to have a lo'u quotation within another lo'u quotation. Note that le'u is not an elidable terminator; it is required.

19.10. More on quotations: ZO, ZOI

The following cmavo are discussed in this section:

zo	ZO	quote single word
zoi	ZOI	non-Relojban quotation
la'o	ZOI	non-Relojban name

The cmavo zo (of selma'o ZO) is a strong quotation mark for the single following word, which can be any Relojban word whatsoever. Among other uses, zo allows a metalinguistic word to be referenced without having it act on the surrounding text. The word must be a morphologically legal (but not necessarily meaningful) single Relojban word; multiple cmavo, zei-lujvo, and bu-letterals cannot be quoted this way. For example:

Example 19.49.

zo si cu lojbo valsi

si is a Lojbanic word.

Since zo acts on a single word only, there is no corresponding terminator. Brevity, then, is a great advantage of zo, since the terminators for other kinds of quotation are rarely or never elidable.

The cmavo zoi (of selma'o ZOI) is a quotation mark for quoting non-Relojban text. Its syntax is zoi .X. text .X., where X is a Relojban word (called the delimiting word) which is separated from the quoted text by pauses, and which is not found in the written text or spoken phoneme stream. It is common, but not required, to use the lerfu word (of selma'o BY) which corresponds to the Relojban name of the language being quoted:

Example 19.50.

zoi gy. John is a man .gy. cu glico jufra

“John is a man” is an English sentence.

where gy stands for glico. Other popular choices of delimiting words are .kuuot., a Relojban name which sounds like the English word “quote”, and the word zoi itself. Another possibility is a Relojban word suggesting the topic of the quotation.

Within written text, the Relojban written word used as a delimiting word may not appear, whereas within spoken text, the sound of the delimiting word may not be uttered. This leads to occasional breakdowns of audio-visual isomorphism: Example 19.51 is fine in speech but ungrammatical as written, whereas Example 19.52 is correct when written but ungrammatical in speech.

Example 19.51.

mi djuno fi lo valsi po'u zoi fi. fight .fi.

I know about the word “fight”.

Example 19.52.

mi djuno fi lo valsi po'u zoi fai. fight .fai.

I know about the word “fight”.

The text fi appears in the written word “fight”, whereas the sound represented in Relojban by fai appears in the spoken word “fight”. Such borderline cases should be avoided as a matter of good style.

It should be noted particularly that zoi quotation is the only way to quote rafsi, specifically CCV rafsi, because they are not Relojban words, and zoi quotation is the only way to quote things which are not Relojban words. (CVC and CVV rafsi look like names and cmavo respectively, and so can be quoted using other methods.) For example:

Example 19.53.

zoi ry. sku .ry. cu rafsi zo cusku

“sku” is a rafsi of “cusku”.

(A minor note on interaction between lo'u ... le'u and zoi: The text between lo'u and le'u must consist of Relojban words only. zoi-quotes may only appear within lo'u-quotes if they contain morphologically valid Relojban containing no instances of le'u.)

Relojban strictly avoids any confusion between things and the names of things:

Example 19.54.

zo	.bab.	cmene	la	.bab.
The-word	“Bob”	is-the-name-of	the-one-named	Bob.

In Example 19.54, zo .bab. is the word, whereas la .bab. is the thing named by the word. The cmavo la'e and lu'e (of selma'o LAhE) convert back and forth between references and their referents:

Example 19.55.

zo	.bab.	cmene	la'e	zo	.bab.
The-word	“Bob”	is-the-name-of	the-referent-of	the-word	“Bob”	.

Example 19.56.

lu'e	la	.bab.	cu	cmene	la	.bab.
A-symbol-for		Bob		is-the-name-of		Bob.

Example 19.54 through Example 19.56 all mean approximately the same thing, except for differences in emphasis. Example 19.57 is different:

Example 19.57.

la .bab. cu cmene la .bab.

Bob is the name of Bob.

and says that Bob is both the name and the thing named, an unlikely situation. People are not names.

The cmavo la'o also belongs to selma'o ZOI, and is mentioned here for completeness, although it does not signal the beginning of a quotation. Instead, la'o serves to mark non-Relojban names, especially the Linnaean binomial names (such as “Homo sapiens”) which are the internationally standardized names for species of animals and plants. Internationally known names which can more easily be recognized by spelling rather than pronunciation, such as “Goethe”, can also appear in Relojban text with la'o:

Example 19.58.

la'o dy. Goethe .dy. cu me la'o ly. Homo sapiens .ly.

Goethe is a Homo sapiens.

Using la'o for all names rather than Lojbanizing, however, makes for very cumbersome text. A rough expansion of la'o might be lo se cmene be zoi.

19.11. Contrastive emphasis: BAhE

The following cmavo are discussed in this section:

ba'e	BAhE	emphasize next word
za'e	BAhE	next word is nonce

English often uses strong stress on a word to single it out for contrastive emphasis, thus

Example 19.59.

I saw George.

is quite different from

Example 19.60.

I saw George.

The heavy stress on “George” (represented in writing by italics) indicates that I saw George rather than someone else. Relojban does not use stress in this way: stress is used only to help separate words (because every brivla is stressed on the penultimate syllable) and in names to match other languages' stress patterns. Note that many other languages do not use stress in this way either; typically word order is rearranged, producing something like

Example 19.61.

It was George whom I saw.

In Relojban, the cmavo ba'e (of selma'o BAhE) precedes a single word which is to be emphasized:

Example 19.62.

mi	viska	lo	ba'e	laldo
I	saw	an	[emphasis]	old one.

I saw an old one.

ba'e before a cmavo that starts a construct serves to emphasize the whole construct:

Example 19.63.

ti	ba'e	ke	vofli	ctuca
This	[emphasis]	[start-grouping]	fly	teaches.

This one's a flying teacher.

Marking a word with a cmavo of BAhE does not change the word's grammar in any way. Any word in a bridi can receive contrastive emphasis marking:

Example 19.64.

ba'e mi viska la .djordj.

I, no one else, saw George.

Example 19.65.

mi ba'e viska la .djordj.

I saw (not heard or smelled) George.

Emphasis on one of the structural components of a Relojban bridi can also be achieved by rearranging it into an order that is not the speaker's or writer's usual order. Any sumti moved out of place, or the selbri when moved out of place, is emphatic to some degree.

For completeness, the cmavo za'e should be mentioned, also of selma'o BAhE. It marks a word as possibly irregular, non-standard, or nonce (created for the occasion):

Example 19.66.

mi	klama	la	za'e	.albeiniias.
I	go-to		so-called	Albania

marks a Lojbanization of an English name, where a more appropriate standard form might be something like la .ckipyris., reflecting the country's name in Albanian.

Before a lujvo or fu'ivla, za'e indicates that the word has been made up on the spot and may be used in a sense that is not found in the unabridged dictionary (when we have an unabridged dictionary!).

19.12. Parenthesis and metalinguistic commentary: TO, TOI, SEI

The following cmavo are discussed in this section:

to	TO	open parenthesis
to'i	TO	open editorial parenthesis
toi	TOI	close parenthesis
sei	SEI	metalinguistic bridi marker

The cmavo to and toi are discursive (non-mathematical) parentheses, for inserting parenthetical remarks. Any text whatsoever can go within the parentheses, and it is completely invisible to its context. It can, however, refer to the context by the use of sumka'i and brika'i: any that have been assigned in the context are still assigned in the parenthetical remarks, but the reverse is not true.

Example 19.67.

doi	.lisas.	mi	djica	lo	nu	to	doi	.frank.
O	Lisa,	I	desire	the	event-of	(	O	Frank,

ko	sisti	toi	do	viska	ti	noi	mlatu
[imperative]	stop!	)	you	see	this	which	is-a-cat.

Lisa, I want you to (Frank! Stop!) see this cat.

Example 19.67 implicitly redefines do within the parentheses: the listener is changed by doi .frank. When the context sentence resumes, however, the old listener, Lisa, is automatically restored.

There is another cmavo of selma'o TO: to'i. The difference between to and to'i is the difference between parentheses and square brackets in English prose. Remarks within to ... toi cmavo are implicitly by the same speaker, whereas remarks within to'i ... toi are implicitly by someone else, perhaps an editor:

Example 19.68.

la .frank. cu cusku lu mi prami do to'i sa'a zo do sinxa la .djein. toi li'u

Frank says “I love you [you = Jane]”

The sa'a suffix is a discursive cmavo (of selma'o UI) meaning “editorial insertion”, and indicating that the marked word or construct (in this case, the entire bracketed remark) is not part of the quotation. It is required whenever the to'i ... toi remark is physically within quotation marks, at least when speaking to literal-minded listeners; the convention may be relaxed if no actual confusion results.

Note: The parser believes that parentheses are attached to the previous word or construct, because it treats them as syntactic equivalents of subscripts and other such so-called “free modifiers”. Semantically, however, parenthetical remarks are not necessarily attached either to what precedes them or what follows them.

The cmavo sei (of selma'o SEI) begins an embedded discursive bridi. Comments added with sei are called “metalinguistic”, because they are comments about the discourse itself rather than about the subject matter of the discourse. This sense of the term “metalinguistic” is used throughout this chapter, and is not to be confused with the sense “language for expressing other languages”.

When marked with sei, a metalinguistic utterance can be embedded in another utterance as a discursive. In this way, discursives which do not have cmavo assigned in selma'o UI can be expressed:

Example 19.69.

la .frank. cu prami sei ri gleki la .djein.

Frank loves (he is happy) Jane.

Using the happiness attitudinal, ui, would imply that the speaker was happy. Instead, the speaker attributes happiness to Frank. It would probably be safe to elide the one who is happy, and say:

Example 19.70.

la .frank. cu prami sei gleki la .djein.

Frank loves (he is happy) Jane.

The grammar of the bridi following sei has an unusual limitation: the sumti must either precede the selbri, or must be glued into the selbri with be and bei:

Example 19.71.

la .frank. cu prami sei gleki be fa la .suzn. la .djein.

Frank loves (Susan is happy) Jane.

This restriction allows the terminator cmavo se'u to almost always be elided.

Since a discursive utterance is working at a “higher” level of abstraction than a non-discursive utterance, a non-discursive utterance cannot refer to a discursive utterance. Specifically, the various back-counting, reciprocal, and reflexive constructs in selma'o KOhA ignore the utterances at “higher” metalinguistic levels in determining their referent. It is possible, and sometimes necessary, to refer to lower metalinguistic levels. For example, the English “he said” in a conversation is metalinguistic. For this purpose, quotations are considered to be at a lower metalinguistic level than the surrounding context (a quoted text cannot refer to the statements of the one who quotes it), whereas parenthetical remarks are considered to be at a higher level than the context.

Relojban works differently from English in that the “he said” can be marked instead of the quotation. In Relojban, you can say:

Example 19.72.

la	.djan.	cu	cusku	lu	mi	klama	lo	zarci	li'u
	John		expresses	[quote]	I	go-to	the	store	[unquote].

which literally claims that John uttered the quoted text. If the central claim is that John made the utterance, as is likely in conversation, this style is the most sensible. However, in written text which quotes a conversation, you don't want the “he said” or “she said” to be considered part of the conversation. If unmarked, it could mess up the anaphora counting. Instead, you can use:

Example 19.73.

lu	mi	klama	le	zarci	sei sa'a
[quote]	I	go-to	the	store	(

la	.djan.	cu	cusku	be	dei	li'u
	John		expresses		this-sentence	)[unquote]

“I go to the store”, said John.

And of course other orders are possible:

Example 19.74.

lu sei sa'a la .djan. cu cusku be dei mi klama le zarci

John said, “I go to the store”.

Example 19.75.

lu mi klama sei sa'a la .djan. cu cusku le zarci

“I go”, John said, “to the store”.

Note the sa'a following each sei, marking the sei and its attached bridi as an editorial insert, not part of the quotation. In a more relaxed style, these sa'a cmavo would probably be dropped.

The elidable terminator for sei is se'u (of selma'o SEhU); it is rarely needed, except to separate a selbri within the sei comment from an immediately following selbri (or component) outside the comment.

19.13. Erasure: SI, SA, SU

The following cmavo are discussed in this section:

si	SI	erase word
sa	SA	erase phrase
su	SU	erase discourse

The cmavo si (of selma'o SI) is a metalinguistic operator that erases the preceding word, as if it had never been spoken:

Example 19.76.

ti	gerku	si	mlatu
This	is-a-dog,	er,	is-a-cat.

means the same thing as ti mlatu. Multiple si cmavo in succession erase the appropriate number of words:

Example 19.77.

ta	blanu	zdani	si	si	xekri	zdani
That	is-a-blue	house,	er,	er,	is-a-black	house.

In order to erase the word zo, it is necessary to use two si cmavo in a row:

Example 19.78.

zo	.bab.	cmene	zo	si	si	la	.bab.
The-word	“Bob”	is-the-name-of	the	word	si,	er,	Bob.

The first use of si does not erase anything, but completes the zo quotation. Then, a single si is sufficient to erase the quotation (cmavo after a zo quote treat it as if it were a single word).

Incorrect names can likewise cause trouble with si:

Example 19.79.

mi	tavla	fo	la	.esperanto
I	talk	in-language	that-named	and-speranto,

si	si	.esperanton.
er,	er,	Esperanto.

The Lojbanized spelling .esperanto breaks up, as a consequence of the Relojban morphology rules (see Chapter 4) into two Relojban words, the cmavo e and the undefined lujvo speranto. Therefore, two si cmavo are needed to erase them. Of course, .e speranto is not grammatical after la, but recognition of si is done before grammatical analysis.

Similarly messy is the result of an incorrect zoi:

Example 19.80.

mi	cusku	zoi	fy.	fy.	si	zo .djan.
I	say	[foreign]	[quote]	[unquote],	er,	“John”	.

In Example 19.80, the first fy is taken to be the delimiting word. The delimiting word must first be repeated to end the quotation. For purposes of si erasure, the entire ZOI structure is taken to be a word, so a single si cmavo is needed to erase it. Similarly, a stray lo'u quotation mark must be erased with le'u si, by completing the quotation and then erasing it with a si cmavo.

As the above examples plainly show, precise erasures with si can be hard to get right. Therefore, the cmavo sa (of selma'o SA) is provided for erasing more than one word. The cmavo following sa should be the starting marker of some grammatical construct. The effect of the sa is to erase back to and including the last starting marker of the same kind. For example:

Example 19.81.

mi	viska	la	sa	.i	mi	cusku	zo	.djan.
I	see	that-named	...		I	say	the-word	“John”	.

Since the word following sa is i, the sentence separator, its effect is to erase the preceding sentence. So Example 19.81 is equivalent to:

Example 19.82.

mi cusku zo .djan.

Another example, erasing a partial description rather than a partial sentence:

Example 19.83.

mi	viska	lo	blanu	.zdan.	sa	lo	xekri	zdani
I	see	the	blue	hou	...	the	black	house.

In Example 19.83, lo blanu .zdan. is ungrammatical, but clearly reflects the speaker's original intention to say lo blanu zdani. However, the zdani was cut off before the end and changed into a name. The entire ungrammatical le construct is erased and replaced by lo xekri zdani.

The cmavo su (of selma'o SU) is yet another metalinguistic operator that erases the entire text, up to the beginning of the last paragraph, lu quote, TO parenthesis, or tu'e grouping.

19.14. Hesitation: Y

The following cmavo is discussed in this section:

.y.

Y

hesitation noise

Speakers often need to hesitate to think of what to say next or for some extra-linguistic reason. There are two ways to hesitate in Relojban: to pause between words (that is, to say nothing) or to use the cmavo .y (of selma'o Y). This resembles in sound the English hesitation noise written “uh” (or “er”), but differs from it in the requirement for a pause before. Unlike a long pause, it cannot be mistaken for having nothing more to say: it holds the floor for the speaker. Since vowel length is not significant in Relojban, the y sound can be dragged out for as long as necessary or repeated.

Since the hesitation sound in English is outside the formal language, English-speakers may question the need for a formal cmavo. Speakers of other languages, however, often hesitate by saying (or, if necessary, repeating) a word (“este” in some dialects of Spanish, roughly meaning “that is”), and Relojban's audio-visual isomorphism requires a written representation of all meaningful spoken behavior. Of course, .y has no grammatical significance: it can appear anywhere at all in a Relojban sentence except in the middle of a word or before bu. .y cannot be quoted using zo, appear as a ZOI delimiter, or be part of a zei-lujvo.

19.15. No more to say: FAhO

The following cmavo is discussed in this section:

fa'o

FAhO

end of text

The cmavo fa'o (of selma'o FAhO) is the usually omitted marker for the end of a text; it can be used in computer interaction to indicate the end of input or output, or for explicitly giving up the floor during a discussion. It is outside the regular grammar, and the machine parser takes it as an unconditional signal to stop parsing unless it is quoted with zo or with lo'u ... le'u. In particular, it is not used at the end of subordinate texts quoted with lu…li'u or parenthesized with to ... toi.

19.16. List of cmavo interactions

The following list gives the cmavo and selma'o that are recognized by the earliest stages of the parser, and specifies exactly which of them interact with which others. All of the cmavo are at least mentioned in this chapter. The cmavo are written in lower case, and the selma'o in UPPER CASE.

zo quotes the following word, no matter what it is.
zei combines the preceding and the following word into a lujvo. Its left side acts like bu, its right side like zo.
si erases the preceding word unless it is a zo, ZOI, or zei.
sa erases the preceding word and other words, unless the preceding word is a zo, ZOI, or zei.
su erases even more words, unless it is preceded by a zo, ZOI, or zei, or precedes a bu or zei
lo'u quotes all following words up to a le'u.
le'u is ungrammatical except at the end of a lo'u quotation.
ZOI cmavo use the following word as a delimiting word, no matter what it is.
BAhE cmavo mark the following word, unless it is si, sa, or su, or unless it is preceded by zo, ZOI, or zei. Multiple BAhE cmavo may be used in succession.
bu makes the preceding word into a lerfu word, except for zo, ZOI cmavo, si, sa, fa'o, zei and bu. Multiple bu cmavo may be used in succession.
UI and CAI cmavo mark the previous word, except for zo, si, sa, su, lo'u, ZOI, fa'o, zei, BAhE cmavo, and bu. Multiple UI cmavo may be used in succession. A following nai is made part of the UI.
da'o, fu'e, and fu'o are the same as UI.

19.17. List of elidable terminators

The following list shows all the elidable terminators of Relojban. The first column is the terminator, the second column is the selma'o that starts the corresponding construction, and the third column states what kinds of grammatical constructs are terminated. Each terminator is the only cmavo of its selma'o, which naturally has the same name as the cmavo.

be'o	BE	sumti attached to a tanru unit
boi	PA/BY	number or lerfu string
do'u	COI/DOI	vocative phrases
fe'u	FIhO	ad-hoc modal tags
ge'u	GOI	relative phrases
kei	NU	abstraction bridi
ke'e	KE	groups of various kinds
ku	LE/LA	description sumti
ku'e	PEhO	forethought mekso
ku'o	NOI	relative clauses
li'u	LU	quotations
lo'o	LI	number sumti
lu'u	LAhE/NAhE+BO	sumti qualifiers
me'u	ME	tanru units formed from sumti
nu'u	NUhI	forethought termsets
se'u	SEI/SOI	metalinguistic insertions
te'u	various	mekso conversion constructs
toi	TO	parenthetical remarks
tu'u	TUhE	multiple sentences or paragraphs
vau	(none)	simple bridi or bridi-tails
ve'o	VEI	mekso parentheses