I consider that I understand an equation |
when I can predict the properties of its solutions, |
without actually solving it. |
– Paul Dirac |
Let’s talk about the process that can be used to acquire an understanding of the type Dirac was talking about. Let’s investigate an example, namely the following expression:
| (1) |
It must be emphasized that we care mostly about the process that will be used. We care more about the process than we care about this particular example. Expression 1 is moderately useful, but it is not nearly so useful as the general-purpose process of figuring things out.
For present purposes, we restrict attention to the case where x and y are non-negative real numbers, which is appropriate for ordinary resistors. (The case of negative or complex x and y is also interesting, and is relevant for capacitors and inductors, but let’s leave that for another day.)
| (2) |
which defines InvAd as a function of two variables.
It may be that most of your experience has dealt with functions of one variable, such as sine, cosine, square root, et cetera. Now we have a function of two variables, which is somewhat harder to visualize ... but not unreasonably so, as we shall see.
Then, the next time you encounter it, think about it some more. Figure out some more of the properties. Figure out some more of the connections.
We call this the spiral approach. You begin by learning some bare facts. Then you spiral back and use principles to connect the dots. Then you learn some new facts and spiral back again to connect them to all the previously known facts and principles. And so on.
Here is a scenario that may not apply to you personally, but illustrates the sort of thing I am talking about.
You may think this section presents more than enough details about the InvAdd function, but actually I have left out more than half of the story. There’s a lot more you can figure out if you want. See e.g. section 3.
You can’t give something a clever name until after you know what it does. Therefore the first time you encounter something, give it a nondescript temporary name, such as F. After you figure out what it does, you can go back and rename it.
| (3) |
The RHS of equation 4b would be ill-behaved if we applied it when x=y=0, but we have nothing to lose by simply defining InvAdd(0,0) to be zero. This is the only reasonable answer, as you can see by setting x equal to y and then taking the limit as they both go to zero.
Equation 4 is equivalent to equation 2 whenever xy is nonzero ... and since equation 4 is well-behaved at zero and equation 2 is not, we have nothing to lose by studying the InvAdd function instead of the InvAd function. So let’s do that.
We can turn InvAdd into a function of one variable by holding the other variable constant. For example, we can evaluate: | In contrast: |
|
|
This makes sense in terms of the physics of a resistor in parallel with a dead short. | This makes sense in terms of the physics of a resistor in series with an open circuit. |
Here is another way of turning InvAdd into a function of a single variable. You can show that: | In contrast: |
|
|
That tells us that an infinite resistance is the identity element for the InvAdd operator (resistors in parallel). | A zero resistance is the identity element for the addition operator (resistors in series). |
Note that strictly speaking infinity is not a number and in general cannot safely be treated as a number ... but in this narrow context it is OK. A thin lens with infinite focal length is a well-known and harmless concept. Also, infinite resistance is a well-known concept. It can sometimes get you into trouble, e.g. if you hook a constant-current source to an infinite impedance load ... but that is no worse than hooking a constant-voltage source to a zero-impedance load.
Here’s yet another way of turning our function into a function of a single variable: We can use the same variable twice. That gives us: | In contrast: |
In other words, InvAdd(x, x) is a straight line with slope 1/2. This has an obvious physical significance in terms of identical resistors in parallel. | In other words, sum(x, x) is a straight line with slope 2. This has an obvious physical significance in terms of identical resistors in series. |
so we already understand a few things about this function.
In this case, we discover that when evaluating InvAdd(x, y), the two input variables need to have the same dimensions; otherwise the denominator in the definition (equation 4) doesn’t make sense.
We also discover that the result returned by InvAdd(x, y) has the same dimensions as x and y. So in this way, InvAdd has something in common with the addition operator: InvAdd(x, y) has the same dimensions as sum(x, y).
The recall process depends on using connections to fish up ideas when they are needed. Therefore the learning process depends on forming appropriate connections. This takes time and effort. Whenever you are exposed to a new idea, you need to mull it over. You need to ponder it, looking for ways in which it is similar – or dissimimlar – to previously-known ideas.
So, in item 12 as well as in item 11 establishing a connection between the InvAdd function and the sum function greatly strengthens our understanding of the InvAdd function ... and even strengthens our understanding of the sum function.
| (11) |
Again: You are allowed to define your own notation.
Learning is not a routine, mechanical process. It is a creative process. See reference 5.
|
|
All of these make sense in terms of the physics of resistors in parallel. | All of these make sense in terms of the physics of resistors in series. |
Note that there is no such thing as “the” distributive property, because there are lots of different things that might or might not distribute over other things. Examples include:
See also item 29 for a discussion of a generalized distributive law.
A remark about the process: Again we are making connections, connecting the algebraic properties of the ∥ operator to the physical properties of electronic circuits ... and also to the algebraic properties of the + operator. Obviously ∥ and + are not exactly identical. They are in some ways the same, and in other ways the opposite.
You can easily verify that if we hold y constant, then as x approaches zero,
Similarly, as x becomes large
We can summarize both equation 14 and equation 16 by saying that when we have N disparate resistors in parallel, the parallel combination is dominated by the smallest resistor in the bunch. | We can summarize both equation 15 and equation 17 by saying that a series combination is dominated by the largest resistor in the bunch. |
Note that in equation 17, we have convergence in log/log space. The sum itself, without the logs, does not actually converge ... but usually people care more about the ratio, as in equation 18, and log/log convergence is an appropriate way to quantify this. |
You can verify the asymptotic behavior as described in the previous item.
For example, you might wonder whether 1/(1/x + 1/y) could sometimes be approximated by x + y, which would make things nice and simple.
Now it turns out that this is a perfectly terrible approximation. There are many ways that you can tell that x∥y is not “usually” equal to x+y, for instance by looking at item 9 or item 10 or item 11 or item 17 or item 18.
Indeed you can easily prove that x∥y is never equal to x+y for any real-valued x and y, except in the trivial case when x=y=0 or when x=y=∞.
However, as a matter of principle, if you want to be scientific you have to check all the plausible hypotheses. Some of the hypotheses will check out, and some of them won’t. Accepting a hypothesis without checking is just as bad as rejecting a hypothesis without checking.
As a case in point: The topic we are discussing today originally arrived at my desk in the following form: «How do you convince a student in a lasting manner that the reciprocal of (1/x + 1/y) is not, in general, x+y?»
That is a perfectly reasonable question as far as it goes, and is worth answering (see item 19) ... but it is only the tip of the iceberg. Any student who understood expression 1 wouldn’t ever think it was equal to x+y. So we should take an indirect approach: First understand the expression, and then use that understanding to answer the original question.
For this reason, the information we have given about resistors in parallel is of very limited importance. You are likely to forget it, and that’s OK. In contrast, the process of figuring things out is of unlimited importance, because you get to use it again and again. That is something you can and should use every day, which means you will never forget it.
If some day you find that you need to know about resistors in parallel, for instance if you are designing some fancy electronic filter network, you can spend the first five minutes of the day re-learning everything you need to know.
The next day, you will probably need to understand some other formula. As long as you remember the process involved in figuring out such things, all will be well.
The next time you encounter an unfamiliar expression:
From now on, do not wait for somebody to tell you to do these things. Do them routinely, automatically. Do not expect the textbooks to tell you to do these things.
Consider the contrast:
Learning to understand what the formula means takes a modest amount of time and effort, and produces a useful result | Rote memorization of the mere form of the formula also takes time and effort, and produces an incomparably less useful result. The memory won’t be connected to anything relevant, so you won’t be able to recall it when needed. |
You may ask why anybody would bother to figure out so many ways of looking at something as simple and unimportant as expression 1. There are three answers:
Here is another example of defining something and exploring what it means.
To set the stage, we define the “reciprocal” operator, i.e. the multiplicative inverse, via:
| (19) |
This is a nice simple function of a single variable.
| (20) |
Then similarly, we can define unary / in terms of binary / and the appropriate identity element:
| (21) |
|
Therefore the fact that x∥y is not equal to x+y (as discussed in item 19) can be considered a rather direct consequence of the fact that the unary Recip operator does not distribute over addition (equation 22b) with help from the self-inverse property (item 23).
which has a simple physical interpretation in terms of adding the conductances for circuit elements in parallel. (The conductance is defined to be the reciprocal of the resistance.)
| (25) |
Here we are using the & operator to represent the InvAdd function. The reasons for this are discussed in section 5.
That gives us another connection. Another symmetry.
In all cases, resistors in parallel lowers the resistance, whereas resistors in series raises the resistance. | In all cases, conductances in parallel raises the conductance, whereas conductances in series lowers the conductance. |
Beware that these rules cannot be extended to AC circuits involving inductors and capacitors, since in that case the impedances can be complex numbers.
where in this case the variables are Boolean, and −, ·, and + denote BooleanNot, BooleanAnd, and BooleanOr (respectively).
Equation 27 expresses De Morgan’s theorem again, using a slightly different notation, using an overbar instead of a − sign to denote the BooleanNot operation:
In going from equation 27c to equation 27d, notice what happens to the operator on the RHS: it goes from (+) to (·). This is a rather abstract idea. We are treating the operator as an object unto itself and saying what happens to the operator ... which is quite a separate question from what happens to the input variables x and y. Note the contrast:
Ordinary Distributive Law | Generalized Distributive Law |
When we distribute multiplication over addition, as in k(x + y), the factor of k acts on the variable x and acts on the variable y, but leaves the operator + unchanged. It is characteristic of the ordinary distributive law that the operator remains unchanged. | When we distribute BooleanNot over BooleanOr, as in x + y, the negation acts on the variable x, acts on the variable y, and also acts on the operator + in accordance with equation 28b. It is characteristic of the generalized distributive law that the operator is modified. |
De Morgan’s theorem can be shorthanded by saying that unary − distributes over · and + in such a way that
These shorthand expressions must be interpreted in the context of the generalized distributive law: Starting from the Boolean expression −(A+B), when we distribute the minus sign, it negates the A, then converts the + into a · in accordance with equation 28b, and then negates the B.
Returning to the resistor law, the corresponding statement is that unary / distributes over + and ∥ in such a way that
Most real-world electrical engineers are quite aware of the algebraic structure of these operators. Indeed, logic circuits are very commmonly implemented using switches in series and switches in parallel.
| (30) |
We define focusing strength (also known as optical power) to be the inverse focal length. The SI unit for focusing strength is the inverse meter, which is also known as a diopter.
So the rule is, for a combination of thin lenses in close proximity, we just add the focusing strengths:
| (31) |
Here we are using the unary reciprocal operator “/” as defined in item 22.
The analogy to electrical resistors looks like this:
| (32) |
Specifically, for lenses in series, we add the focusing strengths, just as for resistors in series, we add the resistances. The analogy is not perfect, because there is no such thing as lenses in parallel.
In particular, if you write down a chart and there is something missing, it means you should look around some more, and it tells you where to look. For example, if you had never heard of the concept of optical power (aka focusing strength), your chart would look like this:
| (33) |
This tells you that you should ask around. Ask if anybody knows the official name for “inverse focal length”. The concept is so obviously useful that you can bet somebody has already named it and studied it ... and if not, you should invent your own name for it.
In connection with resistors, it is traditional and reasonable to define the “parallel” operator via:
| (34) |
For conductances in series, this would not make much sense. Therefore in
Similarly for lenses, the ∥ notation would not make sense, because the lenses are arranged in series along the optical path, not in parallel.
Note the contrast:
When the ∥ symbol is applied to resistances, form follows function, in that x∥y is the correct expression for resistors in parallel. | When the ∥ symbol is applied to conductances, the form is diametrically contrary to the function, since x∥y is the correct expression for conductances in series (not parallel). |
The ∥ symbol does double duty: It tells us to perform the InvAdd mathematical operation, and it also explains why, namely resistors in parallel. | For thin lenses or for capacitances in series, it is better to use the & symbol. This specifies the same mathematical operation, without suggesting an incorrect explanation. |
If you ever run into a situation where form does not follow function, choose a different symbol. For resistance in parallel, the ∥ symbol is a good choice. For other things, such as thin lenses or capacitances in series, & is a better choice.