Synchronous generators are special versions of function definitions and method definitions that always return synchronous iterables:
// Generator function declaration
function* genFunc1() { /*···*/ }
// Generator function expression
const genFunc2 = function* () { /*···*/ };
// Generator method definition in an object literal
const obj = {
* generatorMethod() {
// ···
}
};
// Generator method definition in a class definition
// (class declaration or class expression)
class MyClass {
* generatorMethod() {
// ···
}
}
Asterisks (*
) mark functions and methods as generators:
function*
is a combination of the keyword function
and an asterisk.
*
is a modifier (similar to static
and get
).
yield
If we call a generator function, it returns an iterable (actually, an iterator that is also iterable). The generator fills that iterable via the yield
operator:
function* genFunc1() {
yield 'a';
yield 'b';
}
const iterable = genFunc1();
// Convert the iterable to an Array, to check what’s inside:
assert.deepEqual(
Array.from(iterable), ['a', 'b']
);
// We can also use a for-of loop
for (const x of genFunc1()) {
console.log(x);
}
Output:
a
b
yield
pauses a generator functionUsing a generator function involves the following steps:
iter
(that is also an iterable).
iter
repeatedly invokes iter.next()
. Each time, we jump into the body of the generator function until there is a yield
that returns a value.
Therefore, yield
does more than just add values to iterables – it also pauses and exits the generator function:
return
, a yield
exits the body of the function and returns a value (to/via .next()
).
return
, if we repeat the invocation (of .next()
), execution resumes directly after the yield
.
Let’s examine what that means via the following generator function.
let location = 0;
function* genFunc2() {
location = 1; yield 'a';
location = 2; yield 'b';
location = 3;
}
In order to use genFunc2()
, we must first create the iterator/iterable iter
. genFunc2()
is now paused “before” its body.
const iter = genFunc2();
// genFunc2() is now paused “before” its body:
assert.equal(location, 0);
iter
implements the iteration protocol. Therefore, we control the execution of genFunc2()
via iter.next()
. Calling that method resumes the paused genFunc2()
and executes it until there is a yield
. Then execution pauses and .next()
returns the operand of the yield
:
assert.deepEqual(
iter.next(), {value: 'a', done: false});
// genFunc2() is now paused directly after the first `yield`:
assert.equal(location, 1);
Note that the yielded value 'a'
is wrapped in an object, which is how iterators always deliver their values.
We call iter.next()
again and execution continues where we previously paused. Once we encounter the second yield
, genFunc2()
is paused and .next()
returns the yielded value 'b'
.
assert.deepEqual(
iter.next(), {value: 'b', done: false});
// genFunc2() is now paused directly after the second `yield`:
assert.equal(location, 2);
We call iter.next()
one more time and execution continues until it leaves the body of genFunc2()
:
assert.deepEqual(
iter.next(), {value: undefined, done: true});
// We have reached the end of genFunc2():
assert.equal(location, 3);
This time, property .done
of the result of .next()
is true
, which means that the iterator is finished.
yield
pause execution?What are the benefits of yield
pausing execution? Why doesn’t it simply work like the Array method .push()
and fill the iterable with values without pausing?
Due to pausing, generators provide many of the features of coroutines (think processes that are multitasked cooperatively). For example, when we ask for the next value of an iterable, that value is computed lazily (on demand). The following two generator functions demonstrate what that means.
/**
* Returns an iterable over lines
*/
function* genLines() {
yield 'A line';
yield 'Another line';
yield 'Last line';
}
/**
* Input: iterable over lines
* Output: iterable over numbered lines
*/
function* numberLines(lineIterable) {
let lineNumber = 1;
for (const line of lineIterable) { // input
yield lineNumber + ': ' + line; // output
lineNumber++;
}
}
Note that the yield
in numberLines()
appears inside a for-of
loop. yield
can be used inside loops, but not inside callbacks (more on that later).
Let’s combine both generators to produce the iterable numberedLines
:
const numberedLines = numberLines(genLines());
assert.deepEqual(
numberedLines.next(), {value: '1: A line', done: false});
assert.deepEqual(
numberedLines.next(), {value: '2: Another line', done: false});
The key benefit of using generators here is that everything works incrementally: via numberedLines.next()
, we ask numberLines()
for only a single numbered line. In turn, it asks genLines()
for only a single unnumbered line.
This incrementalism continues to work if, for example, genLines()
reads its lines from a large text file: If we ask numberLines()
for a numbered line, we get one as soon as genLines()
has read its first line from the text file.
Without generators, genLines()
would first read all lines and return them. Then numberLines()
would number all lines and return them. We therefore have to wait much longer until we get the first numbered line.
Exercise: Turning a normal function into a generator
exercises/sync-generators/fib_seq_test.mjs
The following function mapIter()
is similar to the Array method .map()
, but it returns an iterable, not an Array, and produces its results on demand.
function* mapIter(iterable, func) {
let index = 0;
for (const x of iterable) {
yield func(x, index);
index++;
}
}
const iterable = mapIter(['a', 'b'], x => x + x);
assert.deepEqual(
Array.from(iterable), ['aa', 'bb']
);
Exercise: Filtering iterables
exercises/sync-generators/filter_iter_gen_test.mjs
yield*
yield
only works directly inside generators – so far we haven’t seen a way of delegating yielding to another function or method.
Let’s first examine what does not work: in the following example, we’d like foo()
to call bar()
, so that the latter yields two values for the former. Alas, a naive approach fails:
function* bar() {
yield 'a';
yield 'b';
}
function* foo() {
// Nothing happens if we call `bar()`:
bar();
}
assert.deepEqual(
Array.from(foo()), []
);
Why doesn’t this work? The function call bar()
returns an iterable, which we ignore.
What we want is for foo()
to yield everything that is yielded by bar()
. That’s what the yield*
operator does:
function* bar() {
yield 'a';
yield 'b';
}
function* foo() {
yield* bar();
}
assert.deepEqual(
Array.from(foo()), ['a', 'b']
);
In other words, the previous foo()
is roughly equivalent to:
function* foo() {
for (const x of bar()) {
yield x;
}
}
Note that yield*
works with any iterable:
function* gen() {
yield* [1, 2];
}
assert.deepEqual(
Array.from(gen()), [1, 2]
);
yield*
lets us make recursive calls in generators, which is useful when iterating over recursive data structures such as trees. Take, for example, the following data structure for binary trees.
class BinaryTree {
constructor(value, left=null, right=null) {
this.value = value;
this.left = left;
this.right = right;
}
/** Prefix iteration: parent before children */
* [Symbol.iterator]() {
yield this.value;
if (this.left) {
// Same as yield* this.left[Symbol.iterator]()
yield* this.left;
}
if (this.right) {
yield* this.right;
}
}
}
Method [Symbol.iterator]()
adds support for the iteration protocol, which means that we can use a for-of
loop to iterate over an instance of BinaryTree
:
const tree = new BinaryTree('a',
new BinaryTree('b',
new BinaryTree('c'),
new BinaryTree('d')),
new BinaryTree('e'));
for (const x of tree) {
console.log(x);
}
Output:
a
b
c
d
e
Exercise: Iterating over a nested Array
exercises/sync-generators/iter_nested_arrays_test.mjs
In preparation for the next section, we need to learn about two different styles of iterating over the values “inside” an object:
External iteration (pull): Your code asks the object for the values via an iteration protocol. For example, the for-of
loop is based on JavaScript’s iteration protocol:
for (const x of ['a', 'b']) {
console.log(x);
}
Output:
a
b
Internal iteration (push): We pass a callback function to a method of the object and the method feeds the values to the callback. For example, Arrays have the method .forEach()
:
['a', 'b'].forEach((x) => {
console.log(x);
});
Output:
a
b
The next section has examples for both styles of iteration.
One important use case for generators is extracting and reusing traversals.
As an example, consider the following function that traverses a tree of files and logs their paths (it uses the Node.js API for doing so):
function logPaths(dir) {
for (const fileName of fs.readdirSync(dir)) {
const filePath = path.join(dir, fileName);
console.log(filePath);
const stats = fs.statSync(filePath);
if (stats.isDirectory()) {
logPaths(filePath); // recursive call
}
}
}
Consider the following directory:
mydir/
a.txt
b.txt
subdir/
c.txt
Let’s log the paths inside mydir/
:
logPaths('mydir');
Output:
mydir/a.txt
mydir/b.txt
mydir/subdir
mydir/subdir/c.txt
How can we reuse this traversal and do something other than logging the paths?
One way of reusing traversal code is via internal iteration: Each traversed value is passed to a callback (line A).
function visitPaths(dir, callback) {
for (const fileName of fs.readdirSync(dir)) {
const filePath = path.join(dir, fileName);
callback(filePath); // (A)
const stats = fs.statSync(filePath);
if (stats.isDirectory()) {
visitPaths(filePath, callback);
}
}
}
const paths = [];
visitPaths('mydir', p => paths.push(p));
assert.deepEqual(
paths,
[
'mydir/a.txt',
'mydir/b.txt',
'mydir/subdir',
'mydir/subdir/c.txt',
]);
Another way of reusing traversal code is via external iteration: We can write a generator that yields all traversed values (line A).
function* iterPaths(dir) {
for (const fileName of fs.readdirSync(dir)) {
const filePath = path.join(dir, fileName);
yield filePath; // (A)
const stats = fs.statSync(filePath);
if (stats.isDirectory()) {
yield* iterPaths(filePath);
}
}
}
const paths = Array.from(iterPaths('mydir'));
The chapter on generators in Exploring ES6 covers two features that are beyond the scope of this book:
yield
can also receive data, via an argument of .next()
.
return
values (not just yield
them). Such values do not become iteration values, but can be retrieved via yield*
.