Spaced Omnis & the Curtain of Sound
If directional information can be represented by interchannel time delays, then would it be possible to abandon Blumlein's "amplitude for phase difference" substitution outlined last month and concentrate on the time delay information? About the time that Blumlein was experimenting with his crossed figure-of-eights in 1931, the Bell Telephone Labs in the USA derived from experimental binaural work the "curtain of sound" concept (fig.7). The overall wavefront emanating from the orchestra is sampled at a large number of coplanar points. The signals from these microphones feed loudspeakers in the equivalent positions so that the wavefront is effectively recreated. Obviously, with an infinite number of microphone/speaker combinations the reproduction should be perfect, but the Bell Labs work indicated that only three might well be sufficient. A famous experiment was conducted in 1933 whereby the Philadelphia Orchestra performed in Philadelphia, three microphones fed three specially corrected landlines, and Leopold Stokowski balanced the three output channels in Constitution Hall, Washington.
However, what information is actually being recorded? Consider the simplest spaced microphone technique in fig.8: two omnidirectional microphones are placed in front of the orchestra. A central sound-source produces an equal amplitude signal in each mike—we have our familiar double-mono central image. The wavefront from that sound-source also reaches both mikes simultaneously so there is no time delay introduced.
Consider, though, a sound-source at the far left. It is very much nearer mike A than mike B, so there will be an amplitude difference between the two mike outputs. Unfortunately, as the ratio of amplitudes is derived from a ratio of distances, the inverse-square law dictates that there will not be a linear relation between the amplitude ratio and direction between far left/right and centre à la Blumlein, but a square-law relation. Central sources are reproduced as a central image, but a sound-source slightly to one side of centre is reproduced as an image considerably further away from the centre.
But what of the time delay also introduced because of the different distances between the source and the two mikes? This delay introduces a precedence-effect favouring the nearer mike which will further exaggerate the imaging. Because the two mikes generally have a large spacing when compared with the half-wavelength of sound over most of the audio band, many writers have said that the time delay will produce random phase differences between the two channels, so two spaced omnis will produce amplitude-only information. But this will only be true for steady-state signals where each frequency will have a different phase relationship between the channels, resulting in a cyclic "smearing" of the source position with frequency.
For transients and the starts of notes, the precedence-effect time delay operates just as it does for loudspeakers, and it further exaggerates the pulling of images away from the center. If two omnis are used to record a man going from left to right, for instance, his recorded image will slowly move away from the left speaker, with the amount of reverberation increasing, and then move very quickly over to the right with the reverberation decreasing to its original level again.
Thus with recordings made with just two omni mikes, there will be a strong tendency for images to "clump" around the loudspeakers, coupled with an overall vague swimmy quality, with instruments appearing to change position with frequency. In an attempt to firm up the central image, the proponents of this technique nearly always add a third central mike (fig.9), as used in that Bell relay, and this is reasonably effective in reducing the image anomalies so that recorded instruments appear to come from approximately their original positions. However, each pair of mikes—LR, LC, CR—will produce an image of each sound-source in a slightly different place, the only agreement being at the extreme L/R and centre.
And even then, positioning will only hold when the width of the overall source is similar to the microphone spacing. Some time ago I heard a digital master tape of a tramcar, recorded with two omnis spaced around 15' apart, parallel to the rails. When the tramcar was a very long way away stage-left, because this distance is very large compared to the mike spacing, the image was, in fact, central and distant. As the tram approached from the left, the mike spacing became significant first at high frequencies and then on down the audio range. The image smeared towards the left, high frequencies first, until it was localised in the left-hand speaker. As the tramcar reached and then passed the left-hand mike, the recorded image suddenly moved away very quickly to the right-hand speaker, and as it receded, stage-right, first the low frequencies and then the high, smeared back into the centre. Similar recordings made with crossed figure-of-eight mikes show no such anomalies.
Stereo apart, however, there are advantages to the spaced technique, which are why it has persisted in use, particularly in the USA where a generation of recording engineers cut their audio teeth on the Bell work. If the use of figure-of-eight mikes is ruled out, then crossed coincident techniques mean the use of cardioid or hypercardioid mikes. Now the basic response of any mike can only be one of two types: velocity sensing (figure-of-eight), and pressure sensing (omnidirectional). A particular directional pattern, such as cardioid, has to be derived from one or both of these two, and, as anyone who read Angus Mckenzie's recent microphone reviews in HFN/RR (April/May 1981) will appreciate, the derivation of that pattern can introduce significant coloration and severely degrade the overall frequency response. Professional microphones will naturally be much less colored than their amateur cousins, but they generally still add enough coloration to the signal to be identified aurally. It is much easier to design an omnidirectional mike to have an intrinsically flat response within frequency limits set by various mechanical parameters, and recordings made with spaced omnis can have a refreshingly clean sound with translucent high frequencies and a smooth extended bottom end.
The Crown "PZM" microphone places an omni capsule very close to a surface so that the incident and reflected waves reinforce in phase, giving an extremely well maintained hemispherical response, down to a frequency determined by the size of the adjacent surface. The use of a spaced pair of PZMs on the floor, therefore, gives a relatively flat uncoloured sound with well extended bass and, if the anomalous stereo can be accepted, has the enormous advantage, particularly for live opera recording, in that there are no unsightly microphones, or mike stands in view.
The Mid-Side Technique
There is one remaining coincident technique which gives accurate lateral imaging, this being the MS technique described in full by Tony Faulkner in HFN/RR (August 1980). Briefly, a forward-facing cardioid or hypercardioid (Middle) is coincident with a sideways-facing figure-of-eight mike (Side). The sum and difference of the two mikes' outputs give, respectively, the left and right channels, and there are two main advantages: central sources are exactly on-axis for the M mike, so central images are strong and that mike is nearest to giving its flattest response, while the overall stage width can be adjusted by altering the relationship between the two mike outputs. Summing Up
Fig.10 shows diagrammatically the stages presented by the techniques discussed so far. A fascinating article by Bruce Bartlett, an engineer at Shure, examining the subjective stereo results of the various "purist" mike techniques, appeared in various "purist" make techniques, appeared in the December 1979 issue of the American Studio Magazine db; it is well worth a read.
So far, I have only discussed the techniques which aim to capture the stereo information as it would have been perceived at the original event. Let's go back a couple of thousand words: "perfect" imaging can be obtained by panpotting mono signals to suitable positions across the lateral stage. Now every "purist" technique examined has problems—either theoretical or practical, and concerned with the defects of real microphones which don't oblige the mathematician by conforming at all frequencies with their ideal response pattern.
In particular, use of a coincident technique, with its capture of the acoustic in which the musicians are performing, necessarily implies that the acoustic should be both suitable for the kind of music and attractive in its own right. This is rarely the case and, although ideally recordings should only be made in one of the apparently small number of good venues, commercial realities mean that the convenient location of a hall and its facilities often outweigh its total lack of a good acoustic. Walthamstow Town Hall, and All Saints, Tooting, for instance, are often used, yet the excessive wash of reverberation in such places—when empty of an audience-makes the live orchestral sound strange indeed, and not a sound that one would particularly want to record at all. The conductor also has a problem in hearing all of the orchestra!
Peter Fellgett drew a rather nice analogy—if the racecourse is flooded, then running the race with the horses having floats on their hooves might be possible, but it is not really desirable! This, of course, does disregard the commercial aspects which led to the recording happening in the first place, so the engineers often really have no choice but to make the best of an aurally bad job.
If the acoustic is not suitable, then it must be suppressed: using a separate mike for each group of instruments and moving the mikes closer to the instruments will minimise the reverberation picked up, each mike output then being panned in a conventional pop music manner to its position in the stage. Unfortunately, as any instrument is intended to produce its best tone at audience distance, its close sound is generally harsh and overbright, so the use of close mikes means that some treble roll-off has to be used, particularly with instruments such as violins where this degradation is severe. The lack of reverberation means that some degree of artificial ambience has to be introduced, either from an echo-plate or from a distant mike in the hall. Again, this ambience which, unlike that on a "purist" record, carries zero positional information, has to be panpotted to a suitable place in the stage. Also, as the multi-miking has totally destroyed the correct internal balance achieved by the orchestra, this must be restored by adjusting the amplitude of each mike's output with a mixer.
Unfortunately, every mike, as well as picking up its desired instrument, will pick up surrounding instruments, and major alterations to the "raw" balance can prove impossible—bringing up the level of the second violins, for instance, might well bring up that of the horns to an unacceptable degree. Even when the miking is arranged for minimum spill-over, when one instrument has a climax, the pick-up of its sound by remote mikes can become disturbing as its image will then smear and widen. Using a 'scope, this can be seen dramatically with some recordings. As the central solo instruments increases in volume, there is a change from an almost pure lateral cut pattern to a circular pattern as the random phase contributions from other mikes become significant.
As these other mikes are also at different distances, the contribution from each will arrive at a different time from that of the main signal. This, it has been postulated ("Editorial" HFN/RR Feb. 1979), will create a pattern of "echoes," giving an incoherent spurious ambience; but perhaps more importantly, there will be cancellations at frequencies where the distance between any pair of mikes is equal to an odd number of half-wavelengths. The "comb-filter" coloration that this introduces can be severe, yet is impossible to remove with equalisers or tone controls.
With a good team of engineer/producer/conductor, and checking phase relationships between pairs of mikes using a scope, the aural results of a technique in which each solving of a problem appears to create at least two more, can be very impressive. Indeed, probably the majority of records appearing in HFN/RR's "Quality Monitor" column are made this way, and they can be very satisfying.
However, if the stereo image produced by a crossed-pair of ideal figure-of-eight mikes is a "true" two-dimensional image, in that that image accurately reproduces the lateral and depth information that you would have perceived had you been at the recording, the multi-mike technique can only produce a montage of individual mono images (fig.11). With the advent of cheap domestic digital playback systems in the near future, the "joins" in that montage will be all the more apparent, and thus at last the consumer will be able to put real pressure on the record companies. For too long, production teams have been able to sit secure in the knowledge that their necessary—by their lights—compromises can pass undetected by all except the most critical of enthusiasts.
In the same way that we have two related but different art forms in the theatre and in cinema, maybe we will see the appearance of two separate types of recording, as postulated by Columbia/CBS classical producer Paul Myers (HFN/RR Aug. 1978). We can either have a "snapshot" of a concert, or a recording with full utilisation of techniques to produce, deliberately, a new work of art, much as recorded opera apparently is more successful as an aural event, the less relation it bears to the staged version with its limited theatrical effects. One can use real cannons and church bells in Tosca, for instance, or thunder in The Magic Flute, or just about anything in The Ring. One thing is for certain; it should prove interesting to follow the record review pages in HFN/RR throughout the 1980s.
If directional information can be represented by interchannel time delays, then would it be possible to abandon Blumlein's "amplitude for phase difference" substitution outlined last month and concentrate on the time delay information? About the time that Blumlein was experimenting with his crossed figure-of-eights in 1931, the Bell Telephone Labs in the USA derived from experimental binaural work the "curtain of sound" concept (fig.7). The overall wavefront emanating from the orchestra is sampled at a large number of coplanar points. The signals from these microphones feed loudspeakers in the equivalent positions so that the wavefront is effectively recreated. Obviously, with an infinite number of microphone/speaker combinations the reproduction should be perfect, but the Bell Labs work indicated that only three might well be sufficient. A famous experiment was conducted in 1933 whereby the Philadelphia Orchestra performed in Philadelphia, three microphones fed three specially corrected landlines, and Leopold Stokowski balanced the three output channels in Constitution Hall, Washington.
However, what information is actually being recorded? Consider the simplest spaced microphone technique in fig.8: two omnidirectional microphones are placed in front of the orchestra. A central sound-source produces an equal amplitude signal in each mike—we have our familiar double-mono central image. The wavefront from that sound-source also reaches both mikes simultaneously so there is no time delay introduced.
Consider, though, a sound-source at the far left. It is very much nearer mike A than mike B, so there will be an amplitude difference between the two mike outputs. Unfortunately, as the ratio of amplitudes is derived from a ratio of distances, the inverse-square law dictates that there will not be a linear relation between the amplitude ratio and direction between far left/right and centre à la Blumlein, but a square-law relation. Central sources are reproduced as a central image, but a sound-source slightly to one side of centre is reproduced as an image considerably further away from the centre.
And even then, positioning will only hold when the width of the overall source is similar to the microphone spacing. Some time ago I heard a digital master tape of a tramcar, recorded with two omnis spaced around 15' apart, parallel to the rails. When the tramcar was a very long way away stage-left, because this distance is very large compared to the mike spacing, the image was, in fact, central and distant. As the tram approached from the left, the mike spacing became significant first at high frequencies and then on down the audio range. The image smeared towards the left, high frequencies first, until it was localised in the left-hand speaker. As the tramcar reached and then passed the left-hand mike, the recorded image suddenly moved away very quickly to the right-hand speaker, and as it receded, stage-right, first the low frequencies and then the high, smeared back into the centre. Similar recordings made with crossed figure-of-eight mikes show no such anomalies.
Stereo apart, however, there are advantages to the spaced technique, which are why it has persisted in use, particularly in the USA where a generation of recording engineers cut their audio teeth on the Bell work. If the use of figure-of-eight mikes is ruled out, then crossed coincident techniques mean the use of cardioid or hypercardioid mikes. Now the basic response of any mike can only be one of two types: velocity sensing (figure-of-eight), and pressure sensing (omnidirectional). A particular directional pattern, such as cardioid, has to be derived from one or both of these two, and, as anyone who read Angus Mckenzie's recent microphone reviews in HFN/RR (April/May 1981) will appreciate, the derivation of that pattern can introduce significant coloration and severely degrade the overall frequency response. Professional microphones will naturally be much less colored than their amateur cousins, but they generally still add enough coloration to the signal to be identified aurally. It is much easier to design an omnidirectional mike to have an intrinsically flat response within frequency limits set by various mechanical parameters, and recordings made with spaced omnis can have a refreshingly clean sound with translucent high frequencies and a smooth extended bottom end.
The Crown "PZM" microphone places an omni capsule very close to a surface so that the incident and reflected waves reinforce in phase, giving an extremely well maintained hemispherical response, down to a frequency determined by the size of the adjacent surface. The use of a spaced pair of PZMs on the floor, therefore, gives a relatively flat uncoloured sound with well extended bass and, if the anomalous stereo can be accepted, has the enormous advantage, particularly for live opera recording, in that there are no unsightly microphones, or mike stands in view.
There is one remaining coincident technique which gives accurate lateral imaging, this being the MS technique described in full by Tony Faulkner in HFN/RR (August 1980). Briefly, a forward-facing cardioid or hypercardioid (Middle) is coincident with a sideways-facing figure-of-eight mike (Side). The sum and difference of the two mikes' outputs give, respectively, the left and right channels, and there are two main advantages: central sources are exactly on-axis for the M mike, so central images are strong and that mike is nearest to giving its flattest response, while the overall stage width can be adjusted by altering the relationship between the two mike outputs. Summing Up
Fig.10 shows diagrammatically the stages presented by the techniques discussed so far. A fascinating article by Bruce Bartlett, an engineer at Shure, examining the subjective stereo results of the various "purist" mike techniques, appeared in various "purist" make techniques, appeared in the December 1979 issue of the American Studio Magazine db; it is well worth a read.
So far, I have only discussed the techniques which aim to capture the stereo information as it would have been perceived at the original event. Let's go back a couple of thousand words: "perfect" imaging can be obtained by panpotting mono signals to suitable positions across the lateral stage. Now every "purist" technique examined has problems—either theoretical or practical, and concerned with the defects of real microphones which don't oblige the mathematician by conforming at all frequencies with their ideal response pattern.
In the same way that we have two related but different art forms in the theatre and in cinema, maybe we will see the appearance of two separate types of recording, as postulated by Columbia/CBS classical producer Paul Myers (HFN/RR Aug. 1978). We can either have a "snapshot" of a concert, or a recording with full utilisation of techniques to produce, deliberately, a new work of art, much as recorded opera apparently is more successful as an aural event, the less relation it bears to the staged version with its limited theatrical effects. One can use real cannons and church bells in Tosca, for instance, or thunder in The Magic Flute, or just about anything in The Ring. One thing is for certain; it should prove interesting to follow the record review pages in HFN/RR throughout the 1980s.















