# **The Electronic Watch and Low-Power Circuits**

Eric A. Vittoz, IEEE Life Fellow member, evittoz@ieee.org

### **A New Research Laboratory**

This story starts in 1962 with the foundation of CEH (Centre Electronique Horloger or "Watchmakers Electronic Center"). The bipolar junction transistor (BJT) was 14 years old and had already replaced the vacuum tube in most electronic systems. A few years had passed since the invention of the integrated circuit, followed by that of the planar process by Swiss born physicist Jean Hoerni.

The Swiss watch industry was flourishing, exporting about 45 million watches per year for a total value of 1.4 billion Swiss francs. But it was undergoing a deep structural mutation from its traditional organisation based on corporatism. The whole watch industry was of a fully mechanical nature and most of the technical managers had been educated as traditional watchmakers. However, for several years, a very few visionary managers had been watching the recent development taking place on the other side of the Atlantic Ocean, in this strange new domain for them called solid-state electronics.

Led by Gerard Bauer, the dynamic director of the Federation of the Swiss Watch Industry (FH), a research group representing the major Swiss manufacturers was created, to evaluate if and how this new electronics could once be applicable in some way to the wristwatch itself. Through the long existing LSRH ("Swiss Laboratory for Watch Research", founded in the 40's for research in all aspects of the mechanical watch), they entrusted the Swiss Technical Universities with some limited contracts to evaluate these possibilities. They received rather negative conclusions, but however decided that a new joint center would be created to search for possible new watch systems based on electronics. After receiving enthusiastic support from Sydney de Coulon, the general Director of Ebauches SA, the powerful producer of all Swiss watch components, they hired Roger Wellinger as technical Director to lead the project. A Swiss electrical engineer, Roger had spent 15 years in the USA, first as an Associate Professor of EE at Illinois State University in Urbana, then with General Electric in Schenectady.

In 1962 CEH, the new joint research laboratory, was founded in Neuchâtel by several organisations representing most of the Swiss watch industry, with a tentative global budget of 5 million Swiss Francs, to be released year after year. The general mission of CEH was to develop new time-keeping devices. This goal was soon refined into that of developing an electronic wristwatch, with at least one advantage with respect to existing watches. No need to say that, at that time, few people could imagine that one of the main advantages would be a much lower

production cost for a better precision.

There was no real market pull: most people were happy with the precision of their automatic mechanical watch. Some electrical watches had recently appeared on the market, replacing the mechanical drive by an electromagnetic actuator driven from a small mercury battery through a mechanical switch. A French patent was proposing to replace the switch by a transistor (which was realized a few years later). However, this modest change did not bring much advantage, and those approaches were soon abandoned.

The only really new wristwatch was the Accutron that had been developed for Bulova by Swiss engineer Max Hetzel. Based on a metallic tuning fork oscillating at 360Hz and activated by a single transistor, this revolutionary watch had been commercialized in 1960. It was, however, still essentially mechanical, with a complex wheel-ratchet system driving the set of gears, and at a frequency too low to avoid being affected by gravity.

The electronic quartz clock had already been invented by Warren Marrisson in 1928, and transistorized quartz marine chronometers were just available. Their volume was 1.5dm<sup>3</sup>, and their power consumption larger than 100mW. Hence a quartz wristwatch did not have to be invented: it already existed as a system. The "only" challenge was to reduce its volume to less than 3cm<sup>3</sup>, and to lower its power consumption by 4 orders of magnitude, down to less than  $10\mu$ W. This was the maximum acceptable power consumption to ensure more than one year of life for the 1.35V mercury button cell compatible with the wristwatch volume. This new lab was initially lodged just under the roof of the LSRH building in Neuchâtel (Fig. 1). In May 1962, I was the very first electrical engineer hired at CEH by Roger Wellinger.



Fig. 1: LSRH building in Neuchâtel, Switzerland. During its first year, CEH was lodged just under the roof of the main building at the left. It then moved to this wing, where the process line was installed.

### **A Passion for Electricity**

I was born in 1938 in Lausanne. As far as I can remember, I have always been fascinated by electricity. My grandfather was an electrical engineer working for a local power company. His cellar was a cave of treasures, full of various discarded electricity counters. Back home, I used to take them apart, collecting the various parts. Most interesting was the sub-millimeter isolated copper wire of the coils. When I was about six years old, my first applications for it were 220V in-house electrical networks. To connect the meccano-made switches and light bulbs, I just had to scratch the isolation at the wire end with my teeth, bend it, and put it into the power plug. I learned the hard way the meaning of closing an electrical loop when I put my hands both sides of the switch! Nobody seemed to be really worried by the number of shocks I received during this early learning period. However, my father (also an engineer, but in civil engineering) finally decided that I should go to low voltage and he mounted an old radio transformer in a wooden box with just the 6.3V for filament heating available from outside.

My education was in classical literature, with the threat of "descending" to scientific if my results were not good enough (so was the hierarchy of matters at that time!). But I lost no opportunity to communicate my passion to my classmates. When their presentations to the class were about poets and musicians, mine were about electronic recording and electrical transformers (with a demo based on a self-made highvoltage transformer driven by a 4.5V battery chopped by an electric doorbell!). Applications of Ohm's law were the subject of a summer technical contest for which I built a series of demonstrators.

My interest soon concentrated on electronics, which at that time was almost synonymous to radio. At 14, I had built from scratch my first short-wave receiver, winding the thousands of turns of the 2x350V high voltage and manually sawing and bending the aluminium plates to mount the components. I passed my ham radio license at 18 and came "on the air" with a self-made transmitter, with an output power of 10W delivered by a Telefunken RL12P35 power tube (Fig. 2 shows a later version of my station). This was my first contact with the USA and an excellent opportunity to start practicing my school English. I still feel the excitement of waiting for an "opening" of the 10 meters band to contact American or Australian stations as if they were next door! With the Internet, the excitement of distant communication is lost nowadays, but at that time it was achieved with nothing more than my homemade rig.

I received my technical education from the "Ecole Polytechnique de l'Université de Lausanne" (EPUL, that later became EPFL, the Swiss Federal Institute of



Fig. 2: At my homemade ham radio station HB9VZ in 1959.

Technology), where the only professor of electronics was Roger Dessoulavy. I watched him as a god until I could finally attend his courses, during the 4th and last year. Most of the course was based on the vacuum tube, with just two sessions devoted to the transistor. Upon my request, my diploma project was part of a transistorized A/D converter. I then spent one year as a research assistant, working in the ancient gardener's house of EPUL and helping to speed up this converter to audio frequencies. That's where Wellinger found me and asked me to join his new team. We moved to Neuchâtel with my young wife Monique. She was going to provide me with an essential support all along my career.

### **Initial Research Period**

My very first task was to demonstrate that a quartz clock could be operated with the power consumption compatible with a wristwatch. I delivered a demonstrator (Fig. 3) that combined an oscillator circuit with a 10kHz commercial quartz, followed by 4 divide-byten stages, in order to reduce the frequency down to 1Hz. These dividers were synchronized astable multivibrators using 2N1711 planar bipolar transistors produced by SGS that exhibited a large current gain much below the microampere level. All of the electronics was consuming less than 10µA at 1.5V. But the display was a commercially available electric clock using a stepping motor and consuming more than 1mW. With this simple demonstrator on the table, Roger Wellinger had his 1.8 million franc budget for 1963 accepted by the board of directors. The real work could start, but experienced engineers were needed.

Roger was a very enthusiastic and visionary manager. Using all possible means, he was able to con-



# 10 kHz quartz oscillator 4 divide-by-ten stages

### Fig. 3: Microwatt clock demonstrator.

vince a bunch of talented Swiss engineers emigrated in the USA to abandon their interesting and promising jobs to come back to Switzerland and bring their experience to CEH. Among them, Max Forrer left General Electric in Palo Alto to lead the circuit and system section (he became my direct boss for the next 27 years!), and Kurt Hübner came to lead the microelectronic section with his 5-year experience at the Shockley Semiconductor Lab. The real work then started on four parallel paths.

Most essential was the building-up of an in-house IC facility, and the development of a process adapted to low voltage and very low power. Although the first functional MOS transistors had already been published, it was decided to start concentrating on the more traditional bipolar process. A fully operational facility, including a mask shop, was installed in the west wing of the LSRH building (Fig. 1).

The second activity was the search for an adequate time base. The goal was to obtain a precision better than 10ppm (corresponding roughly to 1 second per day) at a frequency lower than 10kHz, to limit the power needed for frequency division. The size of existing quartz resonators at this frequency was much too large, and experts in the field pretended that there was no way to reduce it without drastically reducing the quality factor. Hence we first searched for various types of metallic resonators combined with piezoelectric or electromagnetic transducers. We even evaluated a statistical time base using alpha particles emitted by a radioactive source.

But none of these approaches ended successfully. It is the merit of Armin Frei, another Swiss engineer repatriated from the USA, to have attacked and solved the problem of reducing the size of a sub-10kHz



Fig. 4: First prototype of 8192Hz small size quartz resonator developed at CEH.

quartz resonator. Against all odds and predictions, he was finally able to mount a miniaturized 8192Hz quartz oscillating in flexural mode in the vacuum of a small metallic package (Fig. 4).

The third activity was a search for an adequate display system. The LCD did not exist yet and the LED had just been demonstrated. It was decided to rely on electromechanical solutions, by taking advantage of the matchmakers' skills in micromechanics. Various schemes were evaluated, using electromagnetic and piezoelectric actuators. Two electromagnetic solutions were finally retained for the first prototypes.

The fourth activity was in electronics, essentially evaluating different frequency division techniques. I was mostly involved in this development; hence, I can describe it with more details.

The planar technology was very novel, only able to combine just a few components per chip. The rate at which this number could be increased was uncertain. No low-power process was available yet, and we did not discard the possibility to build the first watch by using discrete components. Therefore, our first aim was to minimize the number of components, and we started by looking into all kinds of analog division techniques.

One possibility was to use synchronized relaxation oscillators (Fig 5a). I tried various schemes, including one based on the tunnel diode (that was considered a very promising device at that time). Another scheme consisted in the step-by-step accumulation of some quantity until a threshold would be reached (Fig. 5b). I first tried to accumulate magnetic flux in a tiny toroidal magnet (of the kind used for computer memories at that time), but the energy per cycle was too high. A better solution was to accumulate a charge in a capacitor, in a kind of switched capacitor integrator, combined with a discharge circuit made of complementary BJT's. A hybrid version of a single divide-byeight stage mounted in a TO5 package was presented at the 1964 International Congress of Chronometry



Fig. 5: Analog frequency division techniques; (a) Relaxation oscillator synchronized by pulses; (b) Step by step accumulation.

(where CEH was asked to be present, although we were normally supposed to keep our results secret!).

It is during this period that I made my first visit to the USA, to attend the ISSCC'64 that was still held in Philadelphia. During an evening panel on "Minimum power solid-state devices and circuits", I was asked by a friend of Wellinger to say a few words about our work for an electronic watch. As I can remember, this idea was considered foolish, if not impossible! This trip was also a fantastic opportunity to visit a number of companies and universities and to meet personalities whose famous names were associated with the incipient semiconductor technology.

Analog solutions were indeed limiting the number of components, but the maximum possible dividing ratio was limited by the precision of these components. No more than a factor ten per stage could be expected, even less with the intended integrated versions. So I looked for an analog solution that could provide a very large divider ratio even with low-precision components. This was based on a phaselock loop as illustrated by Fig. 6.

The loop was made of a low frequency VCO (astable multivibrator) followed by a pulse shaper, the output p of which sampled the high frequency input. The result



was held by a second order filter that produced the voltage  $v_{\rm c}$  controlling the VCO frequency. The phase of the VCO could be locked to that of the input signal for any frequency ratio  $N = f_{\rm in} / f_{\rm out}$  that was keeping  $f_{\rm out}$  inside the hold range of the loop. The divider ratio was controlled by the initial condition imposed to the loop (and stored in its filter). The hold range could be made large enough to compensate for even a large error in the natural frequency of the VCO. The maximum possible ratio N was limited by the jitter of the VCO. Indeed, this jitter was producing a phase noise proportional to N, thus a probability of unlocking increasing with N.

A breadboard implementation of this divider (which became the subject of my Ph.D. thesis) is shown in Fig. 7. Without the oscillator, this divider contained only 3 bipolar transistors, 4 discrete MOS transistors fabricated at CEH and a few resistors and capacitors. A frequency ratio as high as 1000 could be obtained by means of just a few low-precision components. Consuming a few microamperes at 1.35V, it stayed in lock during the year needed to write my dissertation. This project was a lot of fun. It was an opportunity to learn about nonlinear and discrete-time systems, and it made me love the subject of PLL. But the reliability was definitely not sufficient (risk of loosing the ratio) to consider a mass production of watches based on this approach. Fortunately, in the meantime, our in-house process developments were sufficiently advanced to consider using a cascade of integrated binary dividers instead.



Fig. 7: Experimental phaselock loop frequency divider. It includes a 8kHz oscillator based on an early quartz prototype.

The first binary divider that we developed in 1966 (Fig. 8) was a modified version of a circuit found in the literature, itself a transistor implementation of the T-Flip-Flop used with vacuum tubes. The idea of current mirrors had not been proposed yet, and neither was the implementation of lateral pnp's (although both ideas seem so obvious *a posteriori*, like so many great ideas!). Hence, high-value resistors were the only available way to reduce the power. They were implemented by using the base layer, pinched between emitter E and collector C. Their maximum frequency of operation was thus limited by a large distributed capacitor and could be maximized by letting the E and C layers unconnected (floating). Implemented with our 10

micron bipolar process, this divider could operate up to 5kHz for a current drain of  $1\mu A.$ 

Circuit simulators did not exist yet, so circuits had to be verified on a breadboard before integration. All parasitic capacitors were extracted manually from the layout,



Fig. 8: First binary frequency divider.

but these were of course smaller than those associated with the breadboard itself. To circumvent this problem, the value of all extracted capacitors was multiplied by a large factor (typically 1000), whereas the frequency was reduced by the same factor. The dynamic behaviour of the integrated version could then be predicted with good precision. We used this simulation technique during many subsequent years, until circuit simulators and transistor models at very low current were finally available.

It is this integrated frequency divider that was used in the first prototypes of the electronic watch.

### The First Quartz Wristwatch<sup>1</sup>

The efforts of the various groups at CEH came to convergence with the realization of two different prototypes of the quartz wristwatch (Fig. 9). Both used the 8192Hz quartz cantilever developed by Armin Frei, driven by a symmetrical negative resistance circuit. Prototype Beta 1 needed 13 stages of binary division followed by some logic to drive the moving coil of a 1Hz stepping motor. In order to reduce the power consumption, prototype Beta 2 had only 5 stages of division. The 256Hz output was driving an electromagnetic resonant motor, with a ratchet driving the train of wheels. In 1967, ten of these prototypes were presented at the Neuchâtel Chronometry Observatory where they pulverized all previous results. They were then disclosed to our unsuspecting shareholders in a memorable technical seminar.

The industrial production was organized by CEH in collaboration with several interested watch companies. It was decided to start from the Beta 2 prototype



Fig. 9: First prototypes of quartz wristwatch developed by CEH.



Fig. 10: First LSI circuit developed for a wristwatch.

to ensure a battery life longer than one year. All the circuitry was integrated on a single chip, under the responsibility of Raymond Guye. This very first large scale integration circuit for a watch (Fig. 10) combined 110 components (NPN transistors, high-value diffused resistors and junction capacitors) in an area of 8.7mm<sup>2</sup>, and was consuming 12µA at 1.3V. Several other modifications of the watch prototype were introduced to realize the industrial version (Fig. 11) that was named Beta 21. An electronic module was supporting the quartz (not visible in the picture), the plastic encapsulated chip, and an adjustable trimmer capacitor for fine frequency adjustment. The latter became the source of many problems and would have to be eliminated in the future. The components were fabricated by several Swiss companies, but the circuit was produced by CEH in the IC facilities of its microelectronic division. A total of about 6000 units were delivered, and sold under different Swiss trade names.



Fig. 11: Beta 21, module of the first commercial watch developed at CEH.

I personally made very few direct contributions to the development of this first industrial product. Indeed, since 1967, and in parallel with the drafting of my Ph.D. dissertation, I was involved in the very early development of low-voltage CMOS circuits.

### **Pioneering Low-Voltage CMOS Circuits**

While most CEH collaborators were busy working on the first watch prototypes, Fritz Leuenberger was

developing a low-voltage CMOS process. Fritz had joined CEH after working in the semiconductor department of General Electric in Syracuse. Following the key paper of Wanlass and Sah at ISSCC'63<sup>2</sup>, and strongly supported by Max Forrer, he started developing low-threshold P-channel devices (a few of them were used in my experimental PLL frequency divider) before working on a low-voltage CMOS process. In 1968, he had assembled a perfectly workable process<sup>3</sup>. The n-well was obtained by chemically etching the p-type silicon substrate and epitaxially refilling it with n-type material. The surface was then smoothed by mechanical polishing. A molybdenumgold sandwich was used for the gates (and interconnections) to obtain a low threshold voltage for both types of transistors.

I was asked to design a frequency divider in this new process. A circuit based on transmission gates had been presented at ISSCC'67<sup>4</sup>, but its topology was not suited to group P and N channel devices in two distinct areas. Instead, I carried out a direct Huffman synthesis of the divide-by-two function. The resulting logic equations were implemented by means of two 2-level gates and two inverters (Fig. 12), requiring a total of 16 transistors (after merging some transistors). I drew the layout, and Fritz successfully fabricated the circuit. We measured a current consumption of 10nA/kHz at 1.35V, about 20 times less than the bipolar dividers used in Beta 21. Since the maximum frequency at this low voltage was still about 200kHz, it became possible to increase the frequency of the oscillator and thereby reduce the size of the quartz resonator. This frequency was later fixed at 32kHz and is still used nowadays in most electronic watches.



### Fig. 12: First low-voltage CMOS frequency divider.

In 1970, Fritz and his team had developed a new CMOS process in which the lightly doped p-well was obtained by solid-to-solid diffusion of boron in a closed capsule. Standard aluminium gates turned out to be sufficient to obtain a threshold voltage of about 1 volt for both types of transistors. Our first CMOS LSI

watch circuit was integrated in this new low-voltage process.

This experimental circuit included the first realization of digital frequency tuning. The goal was to eliminate the trimmer capacitor. Instead of adjusting the frequency of the quartz oscillator, the basic idea was to adjust the ratio of the frequency divider. The oscillator could then be optimized for a single frequency, its stability was not degraded by a trimmer capacitor and less precision (but no less stability) was required for the quartz.

We found that the most practical solution was to implement an inhibition circuit between the oscillator and a fixed-ratio divider (Fig. 13)<sup>5</sup>. In this scheme, the frequency of the oscillator is slightly increased, so that its exact value due to spreading is always larger or equal to  $2^{15}$  Hz. Fine-tuning is obtained by eliminating the adequate number of pulses delivered by the oscillator within each adjustment cycle, before they enter the divider. With 20 binary stages, the adjustment cycle was 1/32s and the average frequency could be adjusted to within 1ppm. The inhibit circuit was controlled by a 7-bit word, providing 128 ppm of adjustment range.



Fig. 13: Principle of digital tuning and its first experimental on-chip implementation.

Of course, the adjustment word must be specific to each device, since it depends on the exact frequency of the oscillator. It must therefore be stored in a memory. The content of the memory must be kept when the battery is changed, but no  $E^2PROM$  was available at that time. Hence, for this first experimental system called Beta 3, the memory was a set of very simple small switches. But various other solutions were later evaluated, until CMOS compatible  $E^2PROM$  became finally available.

The whole circuit (that did not yet include the oscillator, because CMOS was still considered too crude to implement analog circuits!) contained approximately 500 transistors. Since no computer aid was available, the layout was drawn with colour pencils on a large sheet of transparent graph paper (on the back side of the paper to avoid removing the frame when erasing for corrections!). The layout scale was 1000:1 for the various blocks and was 200:1 for the overall drawing.

Layout rules had to be carefully respected at each step of the drawing, since remaining errors required the complete erasing of large layout areas.

No computer assisted the mask making either. The first step was to report each mask on a special sandwich of red and white plastic called Rubylith. The red layer was manually cut on a coordinatograph according to the pattern, and manually peeled. Although all divider stages were identical (with only one layout), their cutting and peeling had to be repeated 20 times. Most critical was the mask for the contact holes. For some reason, the Rubylith had to be a negative so that each of the 1000 contact holes (2 per transistor since this was a metal-gate process) corresponded to a 2 by 2 millimeter square of red layer left on the white background. They had a natural tendency to peel away by themselves, so the final control was a real nightmare! But the masks were successfully finalized, and the circuit did work at first silicon, with a current consumption of 0.45µA at 1.5V.

It should be noticed that the chain of divider stages is asynchronous, with the advantage of consuming just a little less than twice the first stage alone. But the timing of the inhibit signal fed back from the end of the divider is totally random with respect to that of the pulses delivered by the oscillator. Therefore, the inhibition circuit was designed to operate properly, independently of the order of transition of these two logic signals. However, this turned out to be insufficient. Indeed, detailed measurements made by the customer of a subsequent implementation showed that the circuit stopped operating correctly just within a very narrow range of temperature (less than 1 degree Celsius). The problem was traced back to the fact that, at this particular temperature, the transitions were exactly simultaneous. This unexpected situation had not been considered in the synthesis of the inhibition circuit, which was later corrected. This example illustrates the difficulty of implementing asynchronous circuits, even simple ones. It also shows the necessity to analyze them in detail, and not only simulate them.

The first industrial application of digital tuning was developed for Complication SA (owner of the Piaget brand), with a special 524kHz resonator developed at CEH by Jean Hermann. Since no non-volatile memory was available yet, it was decided to store the adjustment word in a RAM and to extend it to 14 bit. We had developed a new system, called Beta 4, in which the adjustment was made automatically by the watch. Three pulses separated by exactly one second were injected into the watch by magnetic coupling. This reference was used by the watch circuit to generate the adjustment word. I can still remember the semi-discrete experimental system covering the whole surface of my test bench, including two elementary coils providing the coupling (but I have unfortunately made no photograph).

The final product came to the market in 1975 and included a control box (available at the retailer's place). The watch was placed on this box before pushing a button, and two seconds later its frequency was adjusted within 1ppm. A special very small size battery had been added to keep powering the RAM while the main battery was changed. As technical designers of the systems, we were disappointed to discover that the company never advertised its originality. "You do not want to look inside the belly of a beautiful girl" was their answer!

In the early 70's, we kept working on improving the most basic blocks of the watch, the frequency divider. It was not yet clear whether low-voltage CMOS circuits could be produced in large quantities. Thus, with my colleague Jean Fellrath, we started exploring the possibility to use purely digital binary dividers based on BJT NAND gates. Since the most simple known configurations required 6 gates, I built a specialized computer based on TTL circuits, which was searching all possible configurations of n gates for a possibly simpler divider-by-two. The machine found a 4-gate structure that just required the control of a race between two gates<sup>6</sup>. We were trying to implement it by means of the recently published Integrated Injection Logic (I<sup>2</sup>L) when the project was abandoned in favour of CMOS solutions. And I later found my "new" circuit in the literature!

Our existing CMOS divider cell was driven by complementary input variables and was therefore sensitive to a possible race between them. With my collaborator Walter Hammer, we decided to search for race-free structures (meaning that no two gates transit simultaneously), with the goal of increasing the robustness of the frequency divider and possibly reducing its power consumption. It also meant that such structures would have a single input (or would be what was later called "single-clock circuits"), and that correct operation would be ensured independently of the relative speed of the various gates.

But at first we had to give a precise definition of a logic gate, the elementary building block of all sequential circuits. We came to the following definition<sup>7</sup>: a logic circuit is a gate if, and only if, it can be modelled by a single-output delay-free combinatorial circuit followed by an inertial delay (that filters out any pulse shorter that the delay value). A gate has therefore no internal memory. A two-level gate can be implemented in CMOS by means of series/parallel combinations of transistors. But it ceases to be a gate if one input is delayed by more than the output delay (such an input delay might be due to a higher threshold value, or to some delay in the input connection). An inverter is of course always a gate, but a cascade of 3 inverters cannot be considered as a single gate.

Using this definition, I carried out a computer synthesis of all race-free divider blocks possible with a

given number of gates. The computer was the IBM 1130 of the University of Neuchâtel that was fed by punched cards. I was coming in the evening with my stack of cards, retrieving the result the next morning. It took me many days until my FORTRAN program was debugged, but finally I got the results, which is still valid to-day: only one 4-gate race-free divider-by-two is possible whereas 9 solutions exist with 5 gates. The best of these 10 possibilities is the 5-gate structure illustrated in Fig. 14.



Fig. 14: Race-free divide-by-two cell.

The five gates and their interconnections are defined by the five equations (or rather logic implications). The input variable is I, each internal variable Ato E is produced by a gate (D and E by simple inverters), and the gates are interconnected according to the set of equations. The corresponding graph of transitions shows that no more than one variable tends to transit from any given state; hence there is no race between variables. The sequence of stable states shows that each variable transits at half the input frequency and may thus be used as output of the divider stage. This circuit was integrated experimentally in a 5µm silicon gate process that had been developed in the meantime<sup>8</sup> (Fig. 15). At 1.35V, it was consuming only 1.2nA/kHz, thus about 10 times less than the previous realization. A maximum frequency of 2MHz made it possible to use higher quartz frequencies for special products, such as the Beta 4 mentioned previously. The process itself was running on a pilot line, producing a limited quantity of industrial circuits.

The asynchronous divide-by-two cell is the most elementary non-trivial sequential circuit, and computer synthesis of race-free solutions was not applicable to more complex cells. But our younger colleague Christian Piguet later developed a methodology applicable to various types of cells, including D and JK flip-flops<sup>9</sup>.

Returning to 1972, an important contribution was brought by my senior colleague Henri Oguey. After



Fig. 15: First integrated race-free frequency divider.

developing display motors, Henri had been very active in the industrialization of Beta 21. Returning to circuit design, he joined our project on new CMOS frequency dividers. He soon pointed out that among all the transistors of a sequential circuit, only some are active to change the output state of each gate, by charging or discharging the output capacitor. The others are just needed to maintain established states against leakage currents. They are therefore not necessary if the frequency is high enough.



Fig. 16: Transformation of the static circuit of Figure 14 into its dynamic version. Each transistor is represented as a circle, with the name of the variable driving its gate. Only transistors shown in red are needed in the dynamic divider.

I formalized the idea<sup>10</sup> and immediately tried it on my new divider. I pulled the non-necessary transistors out of their sockets in my breadboard simulator.... and the divider kept working. As illustrated in Fig. 16, the static circuit of 22 transistors was transformed into a dynamic version of only 10 transistors (the inverters were not needed any longer, since the transistors driven by them had been removed). Since the total capacitance to be switched was reduced, the power consumption was reduced by about one half. The photograph shows that the dynamic circuit (D/2) is about half the area of the static version (S/2). A dynamic divide-by-three cell (D/3) proposed by Oguey<sup>11</sup> is also shown.

A semi-dynamic version was also developed by exploiting the fact that the short duration of the input signal could be propagated down to low frequencies by using *A* as output variable driving the next stage. I later carried out a systematic synthesis of race-free dynamic divide-by-*N* blocks up to N=6. We then extended the idea of race-free dynamic CMOS to other logic blocks, including D, T and JK flip-flops.

### **Pioneering Weak Inversion for Analog CMOS**

Most of a watch chip area is occupied by digital circuits. But the most critical part, the quartz oscillator, is indeed an analog circuit. To respect the power limitations, transistors had to be biased at unusually low current levels below  $1\mu$ A. The result of my first measurement of a MOS transistor at very low current is shown in Fig. 17.



Fig. 17: First measurement of a MOS transistor at very low current (annotated copy of my notebook).

This was in April 1967. The transistor was a P-channel device fabricated in 1966 by Fritz Leuenberger, with a threshold of about 1 volt. I was deeply surprised to discover that, when the gate voltage was reduced below its threshold  $V_{\rm T}$ , the transfer characteristics were nicely exponential across more than 5 decades of drain current. There was no explanation in the literature, since the transistor was always characterized by a square law behaviour. So we painfully started trying to model this strange behaviour. With my colleague Jean Fellrath, we were struck by the similarity with the BJT and very excited by the possibility to implement analog schemes developed for BJT's, but with a device needing no control current.

## **TECHNICAL LITERATURE**

Jean started with a known current reference (Fig. 18a), in which the size ratio K between two N-channel transistors is compensated by the building up of a voltage across a resistor. On this basis, I developed an amplitude-controlled quartz oscillator (Fig. 18b). In this circuit, the oscillation voltage at the gate of the active transistor of the oscillator is filtered out (by a non-critical RC filter) at the gate of the K-time larger device, whereas both transistors share the same DC component. As the amplitude of oscillation increases, the bias current delivered to the oscillator decreases, until equilibrium is reached. Later, the resistor shown in green line was added to limit the start-up current, and the oscillator was separated from the regulator for more flexibility. I am still very proud of this circuit that, with these modifications, has become a standard in watch oscillators.



Fig. 18: Analog circuits exploiting weak inversion: (a) Current reference borrowed from BJT design. (b) Amplitudecontrolled quartz oscillator.

While we were developing and testing other circuits exploiting these exponential characteristics, several authors were publishing models to describe this "weak inversion" or "sub-threshold" behaviour of MOS transistors (that appears as a DC leakage current in CMOS digital circuits). Most remarkable was the paper by Swanson and Meindl<sup>12</sup>. Indeed, not only did the authors bring important improvements in modelling weak inversion, but they also showed that digital CMOS circuits could operate in weak inversion at a supply voltage as low as 200mV. But the corresponding maximum frequency was so low that the idea was forgotten for the next 30 years!

We put together these various publications, and developed our own simple model that was directly applicable to hand-design (Fig. 19). The width-tolength ratio was originally not included inside current  $I_{\rm D0}$ . This current was later decomposed in the EKV model, showing its exponential dependency on the equilibrium threshold voltage  $V_{\rm T0}$ . This model includes several features that were kept in later developments leading to the modern EKV model: the source-drain symmetry is preserved by defining the various voltages with respect to the (local) substrate, and a slope factor n is introduced to characterize the reduced effect of the gate voltage. It also emphasizes the similarity with BJT's. Indeed, it corresponds to the Ebers-Moll model of a BJT with no base current. But the MOS transistor is a 4-terminal device, with the gate voltage controlling what would be the specific current of the BJT.



Fig. 19: Model of the MOS transistor in weak inversion.  $U_T = kT/q$ , *n* is the slope factor and  $V_{TO}$  is a bias-independent threshold voltage.

The model and our various circuits were presented at ESSCIRC'76<sup>13</sup>, followed by an extended version in our seminal paper of 1977<sup>14</sup>. I still remember that, after giving the ESSCIRC paper, a comment from the floor did seriously question the reliability of analog circuits based on the "leakage current" of transistors. Of course, weak inversion is not a leakage current. It is a well-controlled mode of operation that is only slightly dependent on process parameters. Circuits operating in weak inversion have since been produced by hundreds of millions for applications in watches.

As already mentioned, all circuits were simulated on a breadboard, after scaling up the values of circuit capacitors to render those of the breadboard negligible. This approach posed a special problem for simulating quartz oscillators, because no electrical circuit could simulate the very high quality factor Q of the resonator. While debugging and optimizing the oscillator of Beta 21, I had developed a special technique to solve this problem. It was based on the fact that with such a high Q, the exchange of energy between the circuit and the resonator can only take place at the fundamental frequency. The nonlinear circuit, including the non-motional part of the quartz resonator, was measured by injecting a sinusoidal current. The resulting voltage was filtered (by means of a band pass filter precisely tuned to the measurement frequency) to obtain the amplitude and phase of its fundamental component. The result was then divided by the value of the injected current to obtain the circuit impedance for the fundamental frequency  $Z_{c(1)}$  (Fig. 20).



Fig. 20: Splitting of a quartz oscillator for precise simulation of amplitude and frequency. Stable oscillation is obtained for  $Z_{c(1)} = Z_m$ <sup>[15]</sup>.

This is a very powerful technique<sup>15</sup>, capable of simulating not only the amplitude of oscillation, but also the precise amount by which the circuit is pulling the frequency of oscillation away from the (series) mechanical resonant frequency of the quartz. It is still very useful today, since it can be applied to computer simulation as well. It provides much more insight and precision than the standard lengthy time simulation.

An example of full-blown version of watch oscillator with very low power consumption is illustrated in Fig. 2116. This circuit was fabricated in a bi-doped polysilicon process: in order to minimize the threshold voltage of both types of transistor, the polysilicon gate layer was p-type for P-channel transistors, and n-type for N-channel devices. Hence, a lateral diode appeared at each transition inside the polysilicon layer (normally, it had to be short-circuited). The first application of this diode was proposed by Henri Oguey: he used its large leakage current to maintain some logic states in dynamic circuits, in a scheme called resistance-CMOS circuits<sup>17</sup>. In this oscillator, they were used either as floating diodes, or to replace high-value resistors, by using quads of diodes to obtain symmetrical voltage-current characteristics. As a result, no real resistor was needed, and the total current drain was less than 100nA at 32kHz, almost constant for a supply voltage ranging from 0.8 to 3 volts. The regulator itself consumed only 5% of this current.





In 1975, I started the first course on integrated circuit design at EPFL in Lausanne. I specially emphasized low-power devices and circuits, thereby initiating a culture that has been pursued and expanded by several of my students. CEH had introduced a multicircuit chip program (a precursor of multi-project wafers) that was running every three months, so my students could design small circuits and measure them after integration.

Besides the small pilot line of CEH, low-voltage CMOS circuits were produced by Faselec AG in Zurich, and by EM Microelectronic Marin SA, close to Neuchâtel.

# Further Developments in Low-Power CMOS Devices and Circuits

The work on most basic circuits for the watch was terminated around 1977, but an important step still had to be made: the watch microprocessor. I must confess that, as the head of CEH circuit design activities, I was not convinced of the real need for such a development. But a workgroup was created, including partners from EPFL, from the University of Neuchâtel and from the watch industry. This group, led by my young collaborators Christian Piguet and Jean-Felix Perotto, started working on the idea of long instructions, to minimize the rate of memory access. This was the beginning of an original series of low-power microprocessors especially designed for watches (see the article by C. Piguet in this issue). The flexibility offered by the approach soon rendered it essential, and nowadays all electronic watches include a special microprocessor.

The watch also needed some auxiliary circuits. Among them was a circuit detecting the end of the battery life. Based on a voltage measurement, it required a precise voltage reference on the CMOS chip. We developed our first band gap reference by combining the base-emitter voltage of the vertical BJT available in CMOS with a PTAT voltage produced by MOS transistors in weak inversion<sup>18</sup>. Another reference imagined by Oguey was again exploiting the bidoped poly process. It was using the voltage difference of about one band gap between the threshold voltage of two N-channel transistors with opposite types of gate doping<sup>19</sup>.

At that time, our low-power CMOS circuit research had been extended much beyond traditional watch circuits, this for two main reasons. One was the need to serve customers outside the watch industry. The other was our dream to introduce many other functions in the watch. Our reasoning was that since the watch occupies a unique position on its owner's arm, it could and should provide several useful functions or services. Many possibilities were proposed: calculator, electronic money, electronic key, communication device, electronic compass, altimeter, personal data storage, and physiological check of the wearer. Even though some of these additional functions have later been introduced in special watches (including a GPS receiver that was not yet conceivable in the midseventies), this dream of a multifunctional watch never came to reality. The main technical reason is the difficulty to manually enter information in a small device such as a watch. Not that we did not try: a variety of input devices were developed and tested, essentially based on capacitive sensors implemented on the watch glass. Another reason is that a watch is considered a piece of jewellery rather than a technical device (unlike today's portable phones).

We were given a lot of freedom to explore a wide variety of low-power building blocks. In this framework, our colleague Henri Oguey proved to be particularly creative: his 26 notebooks are a mine of novel ideas, most of them unpublished or even not tested. A good example is the switched current mirror that he imagined in 1977 (Fig. 22). The explanation in French in this figure can be translated into "obtain the same effect as with two perfectly identical transistors". For some obscure reason, we did not apply for a patent, but I was excited by the idea and I made a test circuit in one of our multi-circuit chips. This circuit remained unmeasured (lack of time?) till the mid-80's, when I gave it to my student George Wegmann, as a starting point for his Ph. D. thesis. George made an excellent work, integrating an optimized version, and proving by extensive measurements that such a mirror could reach a precision as good a 0.1%<sup>20</sup>. At the end of 1988, he was ready to publish his results when Daubert et al. published the principle that they named a current copier<sup>21</sup>.

|  | Principe : détenir le menn<br>tos perfactement identiques. |      |   |   | elks | ge 'are | deux | transis |
|--|------------------------------------------------------------|------|---|---|------|---------|------|---------|
|  | 111                                                        | È,   | C | [ | d    |         |      |         |
|  | OUT                                                        | I.g. | e |   | 1-+  |         |      |         |

Fig. 22: Excerpt of Oguey's notebook dated May 1977.

Discussions with Henri Oguey were also at the origin of a new scheme for accurate compensation of offset in amplifiers<sup>22</sup>. In this scheme, an offset compensation voltage is stored, not at the main input of the amplifier, but at an auxiliary input of lower sensitivity. The effect of charge injection can thereby be reduced by a large factor.

After the idea of switched capacitor (SC) circuits was first published in  $1977^{23}$ , we applied it to very low-power circuits. My first realization was a quasi-sinusoidal SC oscillator, intended as a VCO for a tracking filter, and consuming less than  $50nA^{24}$ . As I can

remember, the framework was the search for a possible watch-to-watch communication, using a very narrow bandwidth to minimize power. We later developed several versions of low-power SC circuits, in collaboration with EPFL<sup>25, 26</sup>. I demonstrated that operational amplifiers with low output impedance should be replaced by OTA's in order to minimize noise<sup>27</sup>. By operating the transistors in weak inversion, their intrinsic voltage gain was maximum, so that more than 100dB of gain could be obtained in a single cascoded stage (although a cascode is indeed a very special 2-stage configuration). Hence no phase compensation was needed and the architecture of the OTA was very simple. But slew rate limitations were aggravated by the low bias current level, and we had to develop some schemes for class AB operation<sup>28</sup>. The principle of one of our new schemes is illustrated in Fig. 23<sup>29</sup>. The tail current of the input differential pair is increased proportionally to the difference of its two output currents. For a proportionality factor  $D \ge 1$ , the differential transconductance increases with the input voltage, instead of decreasing to zero as in a normal differential pair (D=0).



Fig. 23: Principle of a class AB amplifier and transfer characteristics in weak inversion.

At low supply voltage, the problem of charge injection by switches (also called clock feedthrough) was particularly serious. In order to be able to reduce it by a compensation technique, I carried out the first detailed analysis of this important problem. This analysis, based on an equivalent circuit, was originally only presented at a course in Leuven <sup>27</sup> (and also published in an obscure journal). It was later confirmed by other authors<sup>30</sup>, and by detailed measurements in the framework of our work on switched mirrors at EPFL<sup>31</sup>.

Latch-up was an important problem in the early time of CMOS. We discovered it accidentally, before it had been reported in the literature. My colleague Jean Hermann was developing a new type of quartz resonator, and he wanted to carry out long-term measurements inside an oven. For this purpose, I gave him one of my recently developed oscillator chips, powered by a big 10Ah battery. After a few days, he called me: the oscillator had stopped and the battery was empty. I took the system back to my lab and measured it with a new battery; it was working perfectly, so I send it back to him. This cycle was repeated several times. While I was once more measuring the circuit, the current suddenly jumped by 4 orders of magnitude when I switched on my big oscilloscope. It returned to its microampere level after switching off and on the supply voltage. Although we had stopped designing bipolar circuits, I had always kept an interest in the BJT. So I was quickly able to identify a thyristor effect. A model made it easy to understand what should be done to avoid the problem. Latch-up could easily be eliminated in sub-microampere circuits, such as watch circuits: A minimum latch-up current of about 1mA was guaranteed by applying adequate layout rules, and a  $2k\Omega$  poly resistor was placed in series with the 1.5V battery, limiting the maximum current below this value.

My sustained interest for the BJT made me suggest to operate a standard MOS transistor as a lateral BJT (which looks obvious, at least *a posteriori*, given the structural similarity of the two types of devices). After negative results obtained by some collaborators, I was not discouraged and decided to measure it myself. I took care to consider it as a 5-terminal device (the 4 terminals of the MOS transistor in its well, and the substrate), and it worked as expected. My first application was a multiple cascoded current mirror shown in Fig. 24. The same devices could be operated as MOS transistors or as lateral BJT's, to demonstrate the drastic improvement in precision. I also demonstrated a band gap voltage reference and a low-noise amplifier<sup>59</sup>. Figure 25 shows a later measurement of 1/fnoise, in which the decreasing gate voltage  $V_{G}$  is compensated by a more negative source voltage  $V_{\rm S}$  to maintain the drain current  $I_D$  constant (at 1µA). Operation is therefore progressively moved from MOS to BJT, showing a dramatic reduction of flicker noise.



Fig. 24: Multiple cascode current mirror. Each inner concentric device can be configured as a MOS transistor or as a lateral BJT.

Weak inversion provides several advantages that can be exploited in low-power circuits. But this mode of operation also has some drawbacks, the most important being the poor precision of current mirrors. Hence, even in very low-power circuits, the various



Fig. 25: Flicker noise measurements (from Stephan Cserveny, unpublished).

transistors must usually be biased with various degrees of inversion, depending on their function. This is the reason why we needed a model covering all levels of current, from weak to strong inversion. In collaboration with Stefan Cserveny, Henri Oguey developed a model called CEMOS that was an important step towards our later EKV model. They introduced the notion of control voltage (that became the pinch-off voltage in EKV). They also emphasized the symmetry of the transistor by expressing the drain current as the difference of two values of the same function, one calculated at the source, the other at the drain (these two values became the forward and reverse components of drain current in the EKV model). The function itself was changing smoothly from the exponential of weak inversion to the square law of strong inversion, by means of a mathematical interpolation.

In 1983, the CEH research laboratories were merged into CSEM (a French acronym for Swiss Centre for Electronics and Microtechnology), a newly founded organization partially supported by the Swiss government. During its more than 20 years of existence, CEH had provided a very open research environment with a lot of research freedom. The definition of most of the projects was sufficiently general to allow the researchers to explore new ideas inside wide domains. I like to compare this kind of exploration with the exploration of unknown territories of the World. We were exploring in particular the continent of very low-power circuits (we called them micropower circuits), with no predefined milestones (what are milestones in the exploration of a terra incognita?). The cafeteria was always a place of intense technical discussions and exchanges between designers and process specialists. We were given the time and the freedom to generalize our ideas and to conceptualize them (a very important step to progress in research). Of course, there were dead ends, but they often could be transformed into new ideas applicable elsewhere. The lack of computer resources forced us to use analytical approaches, which give much more insight than computed sets of numbers or plots (the computer is of course an invaluable complementary tool, but should not replace the analysis as is does too often nowadays). Overall, this CEH period was a fantastic adventure for all of us.

### Low-Power Circuits at EPFL and CSEM

CSEM was more development-oriented, with welldefined specifications to be reached in a well-defined period of time. I moved more of my personal research to EPFL in collaboration with my Ph.D. students.

A very imaginative student, François Krummenacher had independently solved the problem of sensitivity of SC circuits to parasitic capacitors during his Diploma work. He later developed a low-power SC filter approach with intrinsic double correlated sampling to reduce the 1/f noise<sup>32</sup>, and his Ph.D. thesis was about the optimisation of very low-power SC filters. As a first assistant, he later developed along the years a multiplicity of novel low-power circuits, including continuous-time filters<sup>33</sup>.

François also helped me supervise the Ph.D. project of Christian Enz on high-precision CMOS micropower amplifiers. It is in this framework that previous developments of a MOS model at CEH were pursued and extended at EPFL. This new model was later called EKV, after its first publication by the three of  $us^{34}$ . At this stage, the model was already including the dynamic behaviour and the noise, but it still used the mathematical interpolation from weak to strong inversion proposed by Oguey and Cserveny. The model was then extended year after year with the help of several students, under the leadership of Christian Enz, who had become professor. Christian also started an activity on low-power low-voltage RF CMOS circuits. Transceivers were developed that could operate below 1V of supply voltage by biasing transistors close to or in weak inversion<sup>35, 36</sup>. His team also demonstrated new log domain filters based on the exponential characteristics in weak inversion<sup>37</sup> (see the article by C. Enz in this issue).

Although CSEM was more oriented towards development and industrial projects, we kept several advanced research projects, in particular in lowpower integrated circuits. Among them was a project on analog VLSI inspired by biology. The underlying idea was the following. It can be demonstrated that analog remains more power efficient than digital for carrying out tasks requiring no large signal-to-noise ratio<sup>38, 39, 40</sup>. This is the case for tasks of perceptive nature, like vision or audition. Instead, as our brain does, they require a massively parallel system of strongly interconnected cells to carry out collective computation.

Thus, analog VLSI appeared best suited to implement, in particular, low-power image processing systems-on-chip (often called somewhat incorrectly "artificial retinas"). One important problem was (and still is) the density of interconnections that is much lower

on-chip than in the brain. One solution is to implement connections to the neighbouring cells only. This was possible for the motion detection chip shown in Fig. 26, that we developed for a pointing device<sup>41</sup>. This circuit was inspired from the rabbit's eye, and evaluates separately the vertical and horizontal motion components of a pattern of dots.



Fig. 26: Layout of a motion detection chip for pointing devices.

For more general communication the solution was to use the scheme called address-coding events (ACE)<sup>42</sup>. According to this scheme, each cell produces very short pulses (called events) that code their position on a common parallel bus. The analog information is carried by the frequency or by the phase of these events (both of them being continuous values). Different solutions may be used to deal with the possible collision of these asynchronous events. At EPFL, we developed our particular scheme that simply eliminates the results of collisions<sup>43</sup>. This scheme was applied recently by CSEM to a remarkable analog vision-sensor chip that computes at the pixel level the contrast magnitude and direction of image features<sup>44</sup>. For each pixel, this pair of analog values is transmitted on two separate buses as the phase of ACE's with respect to a common clock signal.

The whole project in bio-inspired analog VLSI was also intended to force the exploration of totally novel schemes by a kind of lateral thinking. One result was the idea of Oliver Landolt to represent and compute analog data by means of "place coding"<sup>45</sup>. This approach should be explored further, since it combines the advantages of digital and analog circuits: it allows one to increase the computational accuracy (and the immunity to many kinds of perturbations) of analog data by increasing the number of hardware cells.

Another by-product of bio-inspiration was the con-

cept of pseudo-resistors. According to this concept, illustrated by Fig. 27, any network of linear variable resistors can be replaced by a network of MOS transistors in weak inversion if only currents are considered<sup>46</sup>. The value of the pseudo-resistor may be controlled directly by the gate voltage of transistor T, or by the current  $I_c$  in an associated control transistor T<sub>c</sub>. All transistors may share the same voltage  $V_{ref}$ . If a node (e.g. node B) is grounded in the resistive network, it corresponds to a saturated transistor in the pseudo-resistive network (side B of T saturated), according to the concept of pseudo-ground.



Fig. 27: Equivalent linear networks with respect to current. (a) Resistor R between nodes A and B of a network of variable resistors. (b) Corresponding pseudo-resistor T in weak inversion.

This linearity with respect to currents can be traced back to the fact that diffusion (which is the sole cause of channel current in weak inversion) is a linear process. But the principle is also applicable if the transistors leave weak inversion (the current is then no longer carried only by diffusion), provided all of them share the same gate voltage, as was first demonstrated by Bult and Geelen in 1992477. I formalized the concept in 1997<sup>48</sup> while describing a number of possible applications. I am still fascinated by this property of current linearity that was identified only 30 years after the MOS transistor was first used. This property is linked to the fundamental symmetry of the transistor, as expressed by the EKV model. In weak inversion, this symmetry is only progressively affected when the channel is shortened, whereas structural non-homogeneities along the channel also affect it in moderate and strong inversion<sup>49</sup>. But this functional symmetry is otherwise always independent of the shape of the channel.

### Low-Power, Low-Voltage Today

All of the essential CMOS watch building blocks that have been developed in the past have been adapted to more recent fabrication processes, and they are still used in modern Swiss watches. They include the oscillator, the frequency divider and the digital tuning with an on-chip E<sup>2</sup>PROM. The experience on lowpower quartz oscillator circuits is directly applicable to MEM resonators that have the same equivalent circuit. Special microprocessors have evolved with the increasing complexity of the watch and are present in all modern electronic watches. Special devices such as the bipolar-operated MOS or the lateral diode inside the polysilicon layer are available today in all or some modern sub-micron processes, but they must be characterized before being used.

The modern version of the EKV transistor model is fully charge-based and continuous from weak to strong inversion. It includes second order effects related to submicron processes and is applicable to RF IC design as well as to very low power<sup>50</sup>. Although it has lost the race against the PSP model<sup>51</sup> to become the new standard, it is implemented in many CAD tools<sup>52</sup>. Based on the physical mechanisms underlying the transistor behaviour, it only requires a limited number of parameters that can be predicted from the process, and it maintains the symmetry inherent to the device (which is very useful, if not absolutely necessary, to understand and analyse some analog circuits). It provides a very precise " $g_m/I$ " (transconductance-to-current ratio as a function of the normalized saturation current) relationship that is a very useful tool for optimum analog design. It is also coherent from its simplest form that can be used in understanding and teaching circuits, to its most detailed form applicable in CAD tools.

At the beginning of the 90's, the World has awoken to low power. First because of the increasing importance of portable devices, but also to limit onchip heat generation and to reduce the cost of power supplies in computers. Indeed, power consumption has been moved high in the list of chip specifications (from which it was previously often simply absent). Moreover, with modern submicron processes, the maximum voltage must be limited to limit electric fields. Hence, low-voltage has also become very important nowadays. Circuit techniques developed along the years for low power and low voltage have therefore gained more importance.

For analog subcircuits, a voltage reduction does not help reducing their power consumption<sup>40</sup>. Worse, and especially below 1V, it tends to increase the minimum power needed to achieve a given speed and a given signal-to-noise ratio. One reason for this is the reduction of maximum voltage amplitude due to the drain to source saturation voltage  $V_{\rm DSsat}$  of MOS transistors. The amount of channel inversion must therefore be reduced to reduce  $V_{\text{DSsat}}$ , which reaches its minimum in weak inversion. As a consequence, in any analog circuit operated below 0.5V, transistors must be biased close to or in weak inversion. This connection of weak inversion with low voltage is more fundamental than that with low current (since transistors can in principle remain in strong inversion even at a very low-current if their length-to-width ratio is increased). The special characteristics in weak inversion permit a variety of low-power, low-voltage analog circuits, including translinear loops, log domain filters and analog processors exploiting the concept of pseudo-resistors<sup>53</sup>. A recent experimental application of the latter is the on-line minimization of the total energy spent by a multiprocessor system-on-chip to execute a set of related tasks<sup>54</sup>.

The idea of single-clock digital circuits has been rediscovered after 25 years for application to very high-speed circuits<sup>55</sup>. For digital circuits, power can be reduced by reducing the supply voltage, thereby reducing the energy per transition. The resulting reduction in speed can be compensated by using a smaller feature size and/or by resorting to parallelism<sup>50</sup>. But to maintain a sufficient gate voltage overhead (in order to maintain high speed), the threshold voltage must be reduced. As a result, at zero gate voltage the transistor is no more completely blocked since some non-negligible weak inversion current remains. Often called "DC leakage current", this residual off-state current tends to become an important part of the total power consumption. During its first 30 years of existence, the MOS transistor could be considered as an ideal on-off switch, and CMOS logic was only consuming energy during transitions. The power consumption of idling gates was negligible, thus their percentage did not affect power consumption.

Nowadays, the MOS transistor must be looked at as a *voltage-controlled current modulator*, with a limited on/off current ratio. Since the maximum speed is proportional to the on-current, the off-current and the resulting DC power consumption becomes proportional to the required speed. But this DC power consumed by a gate is also proportional to the idling time of this gate. It can therefore be reduced by increasing the duty factor  $\alpha < 1$  of the gates, defined as the ratio between their effective switching frequency *f* and its maximum possible value  $f_{max}$ .

Now, for a given supply voltage (hence a given gate voltage swing), the maximum on/off current ratio is obtained in weak inversion. Thus, as I have shown in a recent analysis<sup>57</sup>, CMOS logic circuits in weak inversion can be optimized to reach the ultimate minimum of power *P* for a given switching frequency and with a given process. This minimum is shown in Fig. 28 together with the corresponding optimum supply voltage  $V_{\rm B}$ . The maximum possible frequency  $f_{\rm max}$  can be selected by adjusting the off current according to the model of Fig. 19, by means of the source-substrate voltage  $V_{\rm S}$ . The minimum power depends on the process via the total load capacitance *C* of the gate (including interconnections). It splits into a dynamic component and a static component.

Compared with 1 Volt operation (for which the static power would be negligible), a power reduction by more than 50 would be possible if a duty factor  $\alpha = 1$  could be ensured. But this is only possible with 3-stage



Fig. 28: Minimum power consumption P of a CMOS gate operated in weak inversion (red curve) for an average frequency f of on/off transition cycles (not to confuse with the clock frequency). Corresponding optimum supply voltage  $V_{\rm p}$  (black curve).

ring oscillators (that are not very useful circuits). For a more realistic value  $\alpha = 1/100$ , the reduction can still be more than a factor 10, with a maximum possible switching frequency of about 3MHz for a 0.18 micron process (which would reach 500MHz for  $\alpha$ =1).

This ultimate limit assumes that the off-currents of p- and n-channel transistors are adjusted to the same value (by adjusting the source-substrate voltage, which requires a true twin well process). It might not be reached in practice due to several side effects like mismatch or drain induced barrier lowering. The new century has seen the emergence of a new interest for weak inversion logic (now called subthreshold logic) with the realization of digital sub-systems operating at a clock frequency of a few kHz and a supply voltage of just a few hundred millivolts<sup>58</sup> (see article by J. Kwong and A. Chandrakasan in this issue). There is no doubt that further research in this direction (and further process scaling) will permit an increase in system speed. But, at ambient temperature, it will never be possible to decrease the supply voltage much lower (the absolute minimum for regenerative logic being  $2\ln(n+1) \cdot U_{T}$ ). It is interesting to point out that these voltage levels are comparable to that of the action potential in the brain.

The human brain is a fantastic computing machine. With its 10<sup>11</sup> neurons (or 10<sup>12</sup> cells if we include the supporting glial cells), it consumes about the same power as a fast modern microprocessor. But its processing power is immensely larger. It must therefore use much more energy efficient computation strategies. Some of them are known. A massive parallelism compensates for the much lower speed of each neuron (maximum firing rate of about 2kHz). But it also makes possible the spatio-temporal representation and processing of information, by means of arrays of cells organized in maps. The system may be seen as digital, since it uses short pulses of fixed shape and amplitude (called the action potential) to carry the signal. But it is essentially analog since the information is carried by the frequency and by the

phase of these pulses. Its program and data memory is distributed and is stored in the pattern of interconnections and in their variable strengths (synaptic weights). Its learning capability has inspired the approach of artificial neural networks (with interesting but limited results, due probably to an inadequate adaptation to the silicon environment). But what is known today is certainly a very small part of all the unusual schemes exploited by the brain, and new ones are being uncovered by very active teams of neurophysiologists.

Reducing the power consumption of processing chips while increasing their complexity will be possible by further developing traditional techniques, at all levels from process to software. But there will be limits, beyond which totally new approaches will be needed. Why not then try to borrow some of the schemes used by the brain, thereby harvesting the results of half a billion years of evolution? Undoubtedly, these schemes will have to be cleverly adapted to the constraints and possibilities of the silicon environment. But this adaptation will only be possible if a sufficient number of openminded deciders in the industry and in universities organize a close collaboration of their most inventive circuit designers with the community of brain researchers.

- 1 H. J. Oguey "The first quartz wristwatch," to be published in IEEE Spectrum.
- 2 F. M. Wanlass and C. T. Sah, "Nanowatt Logic Using Field-Effect Metal Oxide Semiconductor Triodes," ISSCC'63 Digest of Technical Papers, pp. 32-33, 1963.
- 3 F. Leuenberger and E. Vittoz, "Complementary MOS lowpower low-voltage integrated binary counter," Proc.IEEE, vol.. 57, No 9, pp 1528-1532, Sept. 1969.
- A. K. Rapp et al. "Complementary MOS integrated binary counter," ISSCC'67 Digest of Technical Papers, pp. 52-53, 1967.
  F. Vittoz W. Hammer, M. Kiener and D. Chauwy, "Logical cirtion of the statement of the statement
- E. Vittoz, W. Hammer, M. Kiener and D. Chauvy, "Logical circuit for the wristwatch," Eurocon 1971, Lausanne, paper F2-6.
- 6 E. Vittoz, "LSI in watches," Solid-State Circuits 1976, Ed. Journal de Physique, 1977, Paris. Also published in Pulse, Jan. 1978, pp. 14-20 (Invited paper ESSCIRC'76).
  - E. Vittoz, C. Piguet and W. Hammer, "Model of the logic gate," Proc. Journées d'Electronique 1977, EPF-Lausanne, pp. 455-467.
- 8 E. Vittoz, B. Gerber, and F. Leuenberger, "Silicon-gate CMOS frequency divider for the electronic wrist watch," IEEE J. Solid-State Circuits, vol. SC-7, No2, pp. 100-104, April 1972.
- 9 C. Piguet, "Logic synthesis of race-free asynchronous CMOS circuits," IEEE Journal of Solid-State Circuits, vol. 26, No 3, March 1991, pp. 271-380.
- E. Vittoz and H. Oguey, "Complementary dynamic logic circuits," Electronics Letters, 15th Jan. 1973, vol.9, No 4.
- H. Oguey and E. Vittoz, "CODYMOS frequency dividers achieve low power consumption and high frequency," Electronics Letters, 23rd August 1973, vol. 9, No 17.
- 12 R. M. Swanson and J. D. Meindl, "Ion-implanted complementary MOS transistors in low-voltage circuits," IEEE Journal of Solid-State Circuits, vol. 7, pp. 146 - 153, April 1972.
- 13 E. Vittoz and J. Fellrath, "New analog CMOS IC's based on weak inversion operation," Proc. ESSCIRC '76, pp. 12-13, Toulouse, Sept. 1976.
- 14 E. Vittoz and J. Fellrath, "CMOS analog integrated circuits based on weak inversion operation," IEEE J. Solid-State Cir-

cuits, vol. SC-12, pp. 224-231, June 1977.

- E. Vittoz, M. Degrauwe and S. Bitz, "High-performance crystal oscillator circuits: theory and applications," IEEE J. Solid-State Circuits, vol. SC-23, pp. 774-783, June 1988
- 16 E. Vittoz, "Quartz oscillators for watches," invited paper, Proc. 10th International Congress of Chronometry, pp. 131-140, Geneva, 1979.
- H. Oguey and E. Vittoz, "Resistance-CMOS circuits," IEEE J. Solid-State Circuits, vol. SC-12, pp. 283-285, June 1977.
- 18 E. Vittoz and O. Neyroud, "A low-voltage CMOS bandgap reference," IEEE J. Solid-State Circuits, vol. SC-14, pp. 573-577, June 1979.
- 19 H. J. Oguey and B. Gerber, "MOS voltage reference based on polysilicon gate work function difference," IEEE Journal of Solid-State Circuits, vol. 15, pp. 264 - 269, June 1980.
- 20 G. Wegmann and E. A. Vittoz, "Analysis and improvements of accurate current mirrors," IEEE J. Solid-State Circuits, vol. 25, pp.699-706, June 1990.
- S. J. Daubert, D. Vallancourt and Y. P. Tsividis, "Current copier cell," Electron. Lett., vol. 24, pp. 1560–1562, Dec.8, 1988.
- 22 E. Vittoz, "Dynamic Analog Techniques," in Design of VLSI Circuits for Telecommunication and Signal Processing, Editors J. Franca and Y. Tsividis, Prentice Hall, 1994.
- 23 B. J. Hosticka, R. W. Brodersen, and P. R. Gray, "MOS sampled data recursive filters using switched capacitor integrators," IEEE Journal of Solid-State Circuits, vol. 12, pp. 600 -608, December 1977.
- 24 E. Vittoz, "Micropower switched-capacitor oscillator," IEEE J. Solid-State Circuits, vol. SC-14, pp. 622-624, June 1979.
- 25 R. Dessoulavy, A. Knob, F. Krummenacher and E. Vittoz, "A synchronous switched-capacitor filter," IEEE J. Solid-State Circuits, vol. SC-15, pp. 301-305, June 1980.
- 26 E. Vittoz and F. Krummenacher, "Micropower SC filters in Sigate CMOS technology," Proc. ECCTD'80, vol.1, pp.61-72, Warshaw, 1980.
- 27 E. Vittoz, "Microwatt switched capacitor circuit design," Summer Course on SC circuits, KUL, vol.II, Leuven, June 1981. Republished in Electrocomponent Science and Technology, vol.9, pp. 263-273, 1982.
- 28 F. Krummenacher, E. Vittoz and M. Degrauwe, "Class AB CMOS amplifier for micropower SC filters," Electronics Letters, 25th June 1981, vol.17, No 13, pp. 433-435.
- 29 M. Degrauwe, J. Rijmenants, E. Vittoz and H. De Man, "Adaptive biasing CMOS amplifiers," IEEE J. Solid-State Circuits, vol. SC-17, pp. 522-528, June 1982.
- 30 B. J. Sheu and C. M. Hu, "Switched induced error voltage on a switched capacitor," IEEE J. Solid-State Circuits, vol. SC-19, no. 4, pp. 519-525, Aug. 1984.
- 31 G. Wegmann, E. Vittoz and F. Rahali, "Charge injection in analog MOS switches," IEEE J. Solid-State Circuits, vol. SC-22, pp. 1091-1097, Dec. 1987.
- 32 F. Krummenacher, "Micropower switched capacitor biquadratic cell," IEEE J. Solid-State Circuits, vol. SC-17, pp. 507-512, June 1982.
- 33 F. Krummenacher and N. Joehl, "A 4-MHz CMOS continuous-time filter with on-chip automatic tuning," IEEE Journal of Solid-State Circuits, vol. 23, pp. 750 - 758, June 1988.
- 34 C. Enz, F. Krummenacher and E. Vittoz, "An analytical MOS transistor model valid in all regions of operation and dedicated to low-voltage and low-current applications," Analog Integrated Circuits and Signal Processing, Vol.8, pp.83-114, 1995.
- 35 A. S. Porret, T. Melly, D. Python, C. C. Enz and E. A. Vittoz, "An ultra low-power UHF transceiver integrated in a standard digital CMOS process: architecture and receiver," IEEE Journal of Solid-State Circuits, vol. 36 pp.462-466, March 2001.
- 36 T. Melly, A. S. Porret, C. C. Enz and E. A. Vittoz, "An ultralow-power UHF transceiver integrated in a standard digital CMOS process: transmitter," IEEE Journal of Solid-State Circuits, vol. 36 pp.467-472, March 2001.

- 37 D. Python and C. C. Enz, "A micropower class-AB CMOS log-domain filter for DECT applications," IEEE Journal of Solid-State Circuits, vol. 36, pp. 1067 - 1075, July 2001.
- 38 B. J. Hosticka, "Performance comparisons of analog and digital circuits," Proc. IEEE, vol. 73, pp. 25-29, Jan. 1985.
- 39 E. Vittoz, "Future trends of analog in the VLSI environment," invited session WEAM-9, ISCAS'90, New Orleans, May 2, 1990, digest pp. 1372-1375.
- 40 E. A. Vittoz and Y. P. Tsividis, "Frequency-Dynamic Range-Power," Ch. 10 of Trade-offs in Analog Design, ed. By C. Toumazou and G. Moschytz, Kluwer, 2002, ISBN 1-4020-7037-3.
- 41 X. Arreguit, F. A. van Schaik, F. V. Bauduin, M. Bidiville, and E. Raeber, "A CMOS motion detector system for pointing devices," IEEE Journal of Solid-State Circuits, vol. 31, pp. 1916 - 1921, December 1996.
- 42 M. Mahowald, "VLSI analogs of neuronal visual processing: a synthesis of form and function," Ph.D. dissertation, Computation and Neural Systems, California Institute of Technology, 1992.
- 43 A. Mortara, E. Vittoz and P. Venier, "A communication scheme for analog VLSI perceptive systems," IEEE J. Solid-State Circuits, vol. 30, pp. 660-669, June 1995.
- 44 P. Rüedi, P. Heim, F. Kaess, E. Grenet, F. Heitger, P. Burgi, S. Gyger, and P. Nussbaum, "A 128 \_ 128 pixel 120-dB dynamic-range vision-sensor chip for image contrast and orientation extraction," IEEE Journal of Solid-State Circuits, vol. 38, pp. 2325 2333, December 2003.
- O. Landolt,"Place Coding in Analog VLSI", a neuromorphic approach to computaton," Kluwer, 1998, ISBN 0-7923-8194-7.
- 46 E. Vittoz and X. Arreguit, "Linear networks based on transistors," Electronics Letters, Vol.29, pp.297-299, 4th Febr.1993.
- 47 K. Bult and G. Geelen, ISSCC Digest Tech. Papers, pp.198-199, 1992.
- 48 E. Vittoz, "Pseudo-resistive networks and their applications to analog collective computation," Proc. of MicroNeuro'97, Dresden, pp.163-173, ISBN 3-86005-190-3; also Proc. ICANN'97, Lausanne, pp.1133-1150, ISBN 3-540-63631-5, Springer Verlag, Berlin.
- 49 E. Vittoz, C. Enz and F. Krummenacher,"A basic property of MOS transistors and its circuit implications," Workshop on Compact Modelling, Nanotech 2003, WCM-MSM 2003, San Francisco, Febr. 23-27, Proc. Vol.2, pp.246-249.
- 50 C. C. Enz and E. A. Vittoz,"Charge-based MOS Transistor Modeling: The EKV model for low-power and RF IC design," John Wiley & Sons Inc., 2006, ISBN-13:978-0-470-85541-6.
- 51 G. Gildenblat, X. Li, H. Wang, W. Wu, R. van Langevelde, A.J. Scholten, G.D.J. Smit and D.B.M. Klaassen, "Introduction to PSP MOSFET Model," MSM-Nanotech, 2005.
- 52 http://legwww.epfl.ch/ekv/.
- 53 E. Vittoz, "Analog circuits in weak inversion," chapter 8 of A. Wang et al., Sub-Threshold Voltage Circuit design For Ultra Low Power Systems, Springer, 2006.
- 54 Z. T. Deniz, Y. Leblebici, and E. Vittoz, "On-line global energy optimization in multi-core systems using principles of analog computation," IEEE Journal of Solid-State Circuits, vol. 42, pp.1593-1606, July 2007.
- 55 Y. Ji-Ren, I. Karlsson, and C. Svensson, "A true single-phaseclock dynamic CMOS circuit technique," IEEE Journal of Solid-State Circuits, vol. 22, pp. 899 - 901, October 1987.
- 56 E. Vittoz, "Low-power design: ways to approach the limits," Proc. ISSCC'94, pp. 14-18, San Francisco, February 16, 1994.
- 57 E. Vittoz, "Weak Inversion for Ultimate Low-Power Logic," Chapter 16 of Low-Power Electronic Design, Editor, C. Piguet, CRC Press LLC, 2004.
- 58 A. Wang, B. H. Calhoun and A. P. Chandrakasan, Sub-Threshold Voltage Circuit design For Ultra Low Power Systems, Springer, 2006.
- 59 E. Vittoz, "MOS transistors operated in the lateral bipolar mode and their applications in CMOS technology," IEEE J. Solid-State Circuits, vol. SC-18, pp. 273-279, June 1983.