10/18/2023

Does DSD Sound Better Than PCM? (A Deep Dive)

 


DSD (direct stream digital) is almost like a mythical creature that only certain audiophiles have confronted before.

These are people who come back from their adventures with DSD telling wild stories about how they heard this extra-terrestrial beast murmur a sound that nobody has ever noticed before (besides them).

If you’ve heard of DSD, you may be familiar with these stories. In this article, we’ll uncover if they are true or false; and we’ll separate reality from fiction. 

But First, Some Primer

It may sound obvious, but most mythical stories are not real. If someone were to say that DSD sounds better than PCM, they could be right.

However, the difference between the two is splitting hairs when it comes down to sound quality.

The drawbacks with DSD are high, and the commercial viability of DSD is non-existent compared to PCM.

Before we look at why this is, we need to make sure we understand some of the fundamentals of digital audio first…

What Are Samples?

The common answer to this is usually something like “samples are pieces of music used in other pieces of music.”

While this is correct, in this context a sample is something much smaller than a piece of music.

A sample is the smallest piece of a waveform possible, describing the wave’s exact position at a precise moment in time.

When all these samples are put together, you get a sound wave. You can think of samples as like pixels but for audio.

In some DAWs and wave editors, you can zoom right in and see little dots on the waveform.

These are samples and you may even be able to move them around, though it’s very hard to edit audio this way unless you are removing a few peaks here and there.

What Is Bit Depth?

Audio is stored in computers with bits – 1s and 0s. So if you have some binary code such as 1001010101000110, since it has 16 digits, this means it has 16 bits.

So bit depth is simply a way to store more information per sample in an audio file in x bits. 

  • 16-bit means there are 65,536 different possible values per sample
  • 24-bit offers more than 16 million possible values per sample
  • 32-bit has more than 2.1 billion unique values

But it should be noted that 32-bit float is technically 24-bit with an 8-bit mantissa, and there are no major practical benefits to using 32-bit audio over 24.

What Is Sample Rate?

The sample rate is the number of samples in one second of audio. So if you have a 48 kHz sample rate, then 48,000 samples of the audio are measured per second to recreate the sound.

So, if you are digitally recreating a sine wave at 48 kHz, then you are measuring the amplitude of the sine wave at 48,000 different points per second.

The most common sample rates are 44.1 kHz and 48 kHz, which are used in most DAWs.

What Sample Rate Should You Use For Music?

What sample rate you should use will depend on the audio quality you prefer, but the differences are very hard to notice. 48 kHz is the safest option but most consumers can not tell the difference between 48 kHz and 44.1 Khz.

In this case, we look to other reasons to pick between 48 kHz and 44.1 kHz. Simply put, 44.1 kHz is a leftover from the CD era, and though it sounds perfectly fine, 48 kHz is more compatible with video formats.

From an engineering perspective, calculations involving 48 samples per millisecond are neater than 44.1 samples per millisecond.

If you want to use Max, Reaktor, or make your own VST plugins, this is worth considering.

What is PCM?

PCM stands for pulse-code modulation and is the standard digital audio format used to encode sound waveforms digitally.

With PCM, the amplitude of an audio signal is sampled at regular intervals, creating a waveform.

History

In 1937, Alex Reeves conceived of PCM for voice communication in Britain while working for a phone company in France.

He filed a patent in 1938, describing it in theory with its advantages, but there was no practical application that resulted.

PCM technology was mainly used as a better approach to send telephone calls in the early 1960s, but the quality was not good enough for high-fidelity music until the 80s.

It was finally brought into the consumer market in 1982 when the Compact Disc (CD) brought PCM to the mainstream market and took off for its convenience and fidelity.

What is DSD?

DSD is a niche, 1-bit audiophile digital audio format used to represent analog samples digitally in a different way than PCM.

Instead of sampling the amplitude of audio signals at regular intervals, each bit is an interval that changes depending on the amplitude of the signal.

Does DSD Have a Higher Sample Rate Than PCM?

Yes, DSD (direct stream digital) has a much higher sample rate of over 1000kHz. In comparison, PCM is between 44.1kHz-192kHz.

Does DSD Sound Worse Than PCM?

Not necessarily. The problem with DSD is that you cannot edit, mix, or master it because there’s no proprietary software to do any of these things.

DSD also uses extremely high levels of noise shaping, meaning that at lower frequencies (those we can hear) the noise is very low, but once you get into ultrasonic frequencies the noise level rises extremely quickly.

What Is A 1-Bit Format?

A 1-bit format is about as simple as it gets. Whereas, a 16-bit format has 16 different bits that could either be on or off at the same time, and the combined bits add more complexity.

History of DSD

In the early 1990s, Sony collaborated with Phillips who had begun developing the DSD technology, and together the two made the SACD format, which is the same as DSD but in a consumer format (like PCM is to CD).

However, while the two had finalized the development of the product, the industry had made a giant step forward.

DAC manufacturers switched from 64fs to 128fs, which is twice the sampling rate, as well as a 5-bit format rather than the 1-bit format that they previously created.

So, they were essentially faced with the dilemma of filming a black-and-white film right when color television was invented.

However, DSD didn’t pan out to be commercially viable enough to be used as a mainstream source of digital audio encoding.

PCM became the dominant analog-to-digital encoding format in the early 80s when CDs were invented.

What Are Some Drawbacks of Using DSD?

The problem with DSD is that it is noisier and offers a more limited frequency range than comparable 24-bit PCM of a sampling rate >88.2kHz.

As a 1-bit format, there isn’t enough space in DSD for it to be dithered properly. As a result, you end up with an elevated noise floor.

The same issue applies to the DAC designs that originally inspired the format.

Which Sounds Better? DSD or PCM?

A lot of controversies have been made over which encoding system sounds better, but one can never really know unless they hear for themselves.

Not really easy when almost every digital device uses PCM audio-only.

Furthermore, many argue that DSD is not suitable for high-end applications and high-resolution audio because of its high distortion, but there are still audiophiles who swear that DSD DEFINITELY sounds better.

How Do I Listen To DSD?

You can listen to DSD by using an external Digital to Analog Converter, or DAC. A DAC can handle the high sample rates that are used in DSD format and can be connected to your computer through USB.

You will also need software to be able to listen to DSD as well. Programs like HQPlayer and JRiver work for both Mac and Windows, while Audirvana works for Mac exclusively, and Teac HR Audio Player works for Windows exclusively.

Can You Do Post-Production With DSD?

There’s been no method to edit, mix, and master DSD files like you could PCM files.

Therefore, most “commercially available” DSD recordings are recorded directly to DSD with no mixing/mastering, or are simply converted to and from PCM!

There are a few new software packages that can mix, master, and edit using DSD, but most are from very small niche companies.

Why Does DSD Need To Be Converted To PCM?

PCM does not have noise in the higher frequencies like DSD so it allows non-linear processing effects such as saturation and distortion.

DSD also cannot be dithered like PCM audio, because dithering applies randomness, the 0s and 1s become meaningless and the result is simply noise.

What Is Dithering?

Dither works by adding a bit of random noise to the signal: the sample values are shifted around a tiny bit in an unpredictable way.

This gets rid of the nasty distortion that results when decreasing the bit depth from 24 to 16 or lower.

Even though this raises the noise floor, the result is still less harsh than without dithering.

Dithering is somewhat antiquated and should only be used in the final stage of the production process in mastering when preparing a 16 bit master. Do not apply dithering to anything else!

What Is Quantization?

With PCM, the amplitude of the signal is limited to just one of a set of fixed values, determined by bit depth. This limiting process is called quantization.

Dither is needed on the signal to avoid quantization distortion. For example, quantization occurs when a 24-bit recording (with 16 million possible values per sample), gets converted to 16-bit CD resolution (which has only 65,536 possible values).

What Are The Different Rates Of DSD?

DSD comes as the standard DSD64, double-rate DSD128, Quad-rate DSD256, and Octuple-rate DSD512.

The number at the end of the acronym signifies that it is 64, 128, 256, or 512 times the sample rate as a CD.

Are DSD Files Smaller than PCM files?

DSD records a 1-bit data stream at 2.88 MHz. This amounts to roughly 22 MB of disk space per minute. So, if you have a typical song around 3 minutes in length, that would only equate to 66 MB.

PCM files such as WAV use less space, however. For example, a 24-bit 48 kHz WAV file at the same length is about 40 MB while an MP3 file of that length would be about 4-8 MB.

Playback Options for DSD

What Is DSD Disc Format?

DSD discs are available through the use of specific recorders and appropriate tools.

These discs can be listened to via certain Sony audio hardware devices such as the PlayStation 3, as well as certain Sony laptops.

Can DSD Be Used With USB?

Yes, USB is an alternative to using discs with files burnt onto them for playback. In 2012, a lot of companies teamed up to develop a standard that detects DSD audio in PCM frames they titled DSD over PCM, or DoP.

DSD vs PCM

When comparing DSD vs PCM, it’s important to note that DSD audio has a higher noise floor than PCM audio, a more limited frequency range, and was based on an approach to DAC/ADC design that was substantially improved right after its release.

The other problem is it’s incredibly difficult to work with. In fact, to perform any kind of serious work with DSD, you have to convert it into PCM.

It really seems that in 90% of cases, your average SACD was recorded as PCM audio, mixed as PCM audio, and then converted to DSD audio.

Contradictory Results

The consistent theme when comparing PCM and DSD is contradictory beliefs and opinions, and this goes for scientific studies as well as personal tests.

One double-blind study in Germany found that hardly anybody could distinguish the difference between PCM and DSD audio.

However, in a 2014 study in Tokyo, the results concluded that listeners could distinguish 192 kHz, 24-bit PCM with DSD at 2.8 MHz and 5.6 MHz, but not between 2.8MHz and 5.6Mhz DSD. 

Consumer Market

DSD never achieved any level of success in the consumer market because post-production (which includes editing, mixing engineering, and mastering engineering) is extremely difficult due to the lack of necessary software.

DSD is still used, however, as a format for studio equipment in an archival manner as a possible replacement for analog tapes.

Conclusion

The quality of production, mixing, and mastering in most cases is 99% in the music itself, with that last 1 percent arguably being reliant upon the encoding format (in this case, DSD vs PCM).

Producing music in DSD format is extremely tiresome and near impossible. So unless you have a point to prove, stick to PCM.

If you’re only interested in listening to music, then DSD is worth investigating if you’re really fussy about audio formats.

If we had to pick a clear winner between the two, it would be PCM, largely thanks to its wide compatibility.

FAQ’s

What is SACD Versus DSD?

SACD (Super Audio CD) uses DSD encoding, except it is compatible with CD as well to make it more commercially viable.

It was intended to replace the Compact Disc (CD) in 1999 when it was introduced, with its multiple audio channel functionality.

The SACD layer uses a 1-bit DSD with a 4.7 GB disc capacity while the optional CD layer uses a 16-bit PCM with a 700 MB disc capacity (just like a regular CD).

What Is DST?

DST (Digital Stream Transfer) is a lossless data compression method used to reduce space and bandwidth within DSD.

DST compression reduces the file size by twice or three times and carries eighty minutes of sound.

What is ADC and DAC?

ADC is an analog to digital converter while DAC is a digital to analog converter. They are basically what the name implies.

An ADC converts an analog signal into a digital signal, while a DAC converts a digital signal into an analog audio signal.

DACs and ADCs are commonly used with music players but are also seen in televisions and phones.

What is Nyquist Theorem?

The Nyquist Theorem states that a digital sampling system must have a sample rate at least twice as high as the highest frequency of the audio that is being sampled.

Simply put, this is because you need at least two samples to generate an oscillation.

So with a 48 kHz sample rate, repeating two samples of opposite values would create a 24 kHz tone.

This is why for high-quality audio we use at least a 44.1 kHz sample rate. Because the highest frequency we can hear is 20 kHz, this gives us just enough room to fit the entire audible spectrum of sound.

Source: https://producerhive.com/ask-the-hive/does-dsd-sound-better-than-pcm/

10/17/2023

DSD vs. PCM: Myth vs. Truth

Introduction:

Direct Stream Digital (DSD) has become a big thing in high-end digital audio. Simplified encoding and decoding, along with ultra-high sampling frequencies, promise unparalleled performance. Is this what we’ve all been waiting for or just mass-marketing hype? This blog separates the hype from the technical facts. I’ll explain in what ways DSD has the advantage and in what ways pulse-code modulation (PCM) is better.

If you're not sure if you should believe the statements in this blog which contradict much of the marketing hype, myth, and legend in the audiophile industry, feel free to check the references at the end of this blog.

You also may want to refer to my other blog on “The 24-Bit Delusion.”


A Brief History:

In 1857, Édouard-Léon Scott de Martinville invented the phonautograph, which could graphically record sound waves. In early 1877, Charles Cros devised a way to reverse that process on a photoengraving to form a groove which could be traced by a stylus, causing vibrations that could be passed on to a diaphragm, recreating sound waves.

In late 1877, Thomas Edison used Cros’ theories to invent the cylinder phonograph, allowing music lovers to experience recorded music in their homes for the first time. Can you imagine a modern cylinder phonograph? Tangential tracking…no arc error…no skating error. The concept was flawless.

In 1887, Emile Berliner invented the technically inferior disk phonograph. Disks warp and there was arch error and skating errors introduced. Certainly no comparison to the tangential tracking Edison cylinder player.

But since disks are much cheaper to produce than cylinders, and since disk fit nicely in display bins at stores and can include larger cover art and notes, they became the standard. And so began the long history of the recorded music industry being more about consumer convenience and optimal profits than about optimal fidelity.

The digital revolution was no different. Philips and Sony collaborated on the new standard for a consumer digital format in 1979. Philips wanted a 20 cm disk, but Sony insisted on a 12 cm disk which could be played in a smaller portable device. In 1980 they published the Red Book CD-DA standard, and mass-market digital music was born. Many in the recording industry in the early days of digital joked that CD stood for “compromised disk.”

In the early 1980s, when digital recording became readily available, studios converted from analog to digital to save money. For studios, this cost less for the equipment, required less space for both recording and archiving, and made it easier to mix and edit tracks in post-production. For consumers, there weren't many advantages. Most of the early digital recordings were produced with relatively low resolution and sounded so fatiguing they would make you want to tear your ears off.

The switch from PCM to DSD was no different. In the early 1990s Sony wanted a future-proof, less expensive medium to archive their analog masters. In 1995 they concluded that storing a 1-bit signal directly from analog-to-digital would allow them to output to any conceivable consumer digital format (LOL...later I'll explain how Sony screwed the pooch on this decision). This new 1-bit technology was achieved by outputting from the monitoring pin on Crystal’s new 1-bit 2.8Mhz Bit Stream DAC chip.

Later, Sony’s consumer division caught wind of DSD and collaborated with Philips to create the SACD format. Of course, from the time the SACD was conceived until the time it came to market, DAC chip manufacturers had advanced from 64fs to a higher 128fs sampling rate (aka Double-Rate DSD) and from 1-bit to a higher-resolution 5-bit wide-DSD format. If the SACD format was DSD128 instead of DSD64 and 5-bits instead of 1-bit it would have made a huge difference in performance. Oops.

Long before the DVD, SACD, or DSD formats were developed, the Bit Stream DAC chip was introduced to the consumer market as a lower-cost alternative to the significantly more expensive R-2R multi-bit DAC chip. Bit Stream DAC chips have built-in algorithms to convert PCM input to DSD, which is then converted to analog. Once again, the result was a huge cost saving at the expense of fidelity.

It was in part Bit Stream DAC technology which allowed the development of our modern 7.1 channel audio that’s embedded into video formats. This also allowed electronics manufacturers to market DVD players in small chassis with cheap power supplies which could retail for under $70. Once again, the audio purist never stood a chance.

In contrast, not only do multi-bit R-2R DAC chips cost significantly more to manufacture than single-bit DAC chips, but they also require much larger and more sophisticated power supplies. If you were to make a 7.1 channel R-2R multi-disk player, it would cost several times the price of Bit Stream technology and it would be several times the size. Certainly not what the average consumer is looking for.

To sum things up, the recorded music industry has made decision after decision to maximize profits and mass consumer appeal at the expense of the audio purist. History lesson over.


DSD vs. PCM Technology:

PCM recordings are commercially available in 16-bit or 24-bit and in several sampling rates from 44.1KHz up to 192KHz. The most common format is the Red Book CD with 16-bits sampled at 44.1KHz. DSD recordings are commercially available in 1-bit with a sample rate of 2.8224MHz. This format is used for SACD and is also known as DSD64 or single-rate DSD.

There are more modern, higher-resolution 1-bit DSD formats, such as DSD128, DSD256, and DSD512 as well as wide-DSD formats with 5-bit to 8-bit Delta-Sigma decoding which I will explain later. These formats were created for recording studios and comprise only a very small portion of the recordings which are commercially available.

Though you can’t make a direct comparison between the resolution of DSD and PCM, various experts have tried. One estimate is that a 1-bit 2.8224MHz DSD64 SACD has similar resolution to a 20-bit 96KHz PCM. Another estimate is that a 1-bit 2.8224MHz DSD64 SACD is equal to 20-bit 141.12KHz PCM or 24-bit 117.6KHz PCM.

In other words a DSD64 SACD has much higher resolution than a 16-bit 44.1KHz Red Book CD, roughly the same resolution as 24-bit 88.2KHz PCM recording, and not as much resolution as a 24-bit 176.4KHz PCM recording.

Both DSD and PCM are “quantized,” meaning numeric values are set to approximate the analog signal. Both DSD and PCM have quantization errors. Both DSD and PCM have linearity errors. And both DSD and PCM have quantization noise that requires filtering at the output stage. In other words, neither one is perfect.

PCM encodes the amplitude of the analog signal sampled at uniform intervals (sort of like graph paper), and each sample is quantized to the nearest value within a range of digital steps. The range of steps is based on the bit depth of the recording. A 16-bit recording has 65,536 steps, a 20-bit recording has 1,048,576 steps, and a 24-bit recording has 16,777,216 steps.

The more bits and/or the higher the sampling rate used in quantization, the higher the theoretical resolution. So a 16-bit 44.1KHz Red Book CD has 28,901,376 sampling points each second (44,100 x 65,536). And a 24-bit 192KHz recording has 32,212,254,000,000 sampling points each second (192,000 x 16,777,216). This means 24-bit 192KHz recordings have over 111,455 times the theoretical resolution of a 16-bit 44.1KHz recording. No small difference.

So why is it that HD recordings sound only slightly better than a 16-bit 44.1KHz recordings made from identical masters? Later in this blog I’ll explain the difference between theoretical and actual resolution.

DSD encodes music using pulse-density modulation, a sequence of single-bit values at a sampling rate of 2.8224MHz. This translates to 64 times the Red Book CD sampling rate of 44.1KHz, but at only one 32,768th of its 16-bit resolution.

 

In the above graphical representation of PCM as a dual axis quantization, and DSD as a single axis quantization, you can see why the accuracy of DSD reproduction is so much more dependent on the accuracy of the clock than PCM. Of course, the accuracy of the voltage of each bit is just as important in DSD as PCM, so the regulation of the reference voltage is equally important in both types of converters.

Of course the accuracy of the clocking during the recording process which is done at several times the resolution of commercial DSD64 SACD and 16-bit 44.1KHz PCM recordings is significantly more important than the accuracy of the clocking of either DSD or PCM during playback.

There are other DSD formats which use higher sampling rates, such as DSD128 (aka Double-Rate DSD), with a sampling rate of 5.6448MHz; DSD256 (aka Quad-Rate DSD), with a sampling rate of 11.2896MHz; and DSD512 (aka Octuple-Rate DSD), with a sampling rate of 22.5792MHz. And most modern A to D and D to A Delta-Sigma converters do multibit wide-DSD with 5-bits to 8-bits decoding in parallel. All of these higher-resolution DSD formats were intended for studio use as opposed to consumer use, though there are some obscure companies selling recordings in these formats.

Note that Double, Quad, and Octuple DSD have both the potential for a 44.1KHz multiple and a 48KHz multiple sample rate for 100% equal division down to DSD64 SACD and 44.1KHz Red Book (both 44.1KHz multiples) or 96KHz and 192KHz High-Definition PCM formats (both 48KHz multiples).

Of course when studios convert a 48KHz multiple format to a 44.1KHz multiple format or visa versa they introduce quantization errors. Sadly this is often the case with older recordings when they are released in a remastered 24-bit 192KHz HD version derived from DSD64 masters, such as the ones Sony and other companies used to archive their analog masters in the mid-90's. Note that the optimal HD PCM format which can be created from a DSD64 master would be 24-bit 88.2KHz. Any sampling rate over 88.2KHz or that is equally divisible by 48KHz would have to be interpolated (not good). But consumers demand 24-bit 192KHz versions of all their old favorites, so companies provide them, despite the known consequences.


The Problems:

There are three major areas where both PCM and DSD fall short of perfection: quantization errors, quantization noise, and non-linearity.

Quantization errors can occur in several ways. One way which was most common in the early days of digital recording had to do with the resolution being too low. Think of the intersection points on a piece of graph paper. You can’t quantize to a fraction of a bit, and you can’t quantize to a fraction of a sampling rate. You can only quantize to a value which falls on the intersection points of bit-depth and sampling rate. When the value of the analog signal falls between two quantization values, the digital recording ends up recreating the sound lower or higher in volume and/or slower or faster in frequency, distorting the time, tune, and amplitude of the original music. Often this creates unnatural, odd harmonics which result in the hard, fatiguing sound associated with early digital recordings. Note on the graphic below that the solid blue line represents the actual music wave and the black dots represent the closest quantization values.

Though modern sampling rates are high enough to fool the human ear, quantization errors still occur when translating from one format to another. For example, when Sony decided to archive their analog master libraries to DSD64 back in 1995, they were wrong to believe that these masters would be future-proof and able to reproduce any consumer format. The fact is, these masters could only properly reproduce a format that was divisible by 44.1KHz. So any modern 96KHz or 192KHz recording created from DSD64 master files have quantization errors.

This leads me to one of the many things that enrage me about the recorded entertainment industry. If 44.1KHz was the standard which was engineered to put aliasing errors in less critical audio frequencies, then why did they start using multiples of 48KHz?!?!?!? All they had to do was go with 88.2KHz and 176.4KHz as the modern HD consumer formats, and all of this mess could have been avoided. They made DXD, a 24-bit 352.8KHz studio format, equally divisible by 44.1KHz. What blithering idiot decided to put a wrench in the works with 96KHz and 192KHz HD audio?!?!?!?

The actual reason for the 48KHz multiple has to do with optimal synchronizing to video. So it makes sense to have sound tracks from movies recorded in a 48KHz multiple, such as the 24-bit 96KHz format embedded into 7.1 channel audio on DVDs and Blu-Rays. But since over 90% of all music recordings are sold in a 44.1KHz for Red Book CD or DSD64 SACD it is rather ridiculous to offer any HD music in 96KHz or 192KHz as opposed to the optimal 88.2KHz and 176.4KHz HD formats. But because naive consumers wrongly believe that the higher the sampling rate the higher the fidelity they demand 192Khz falsely believing it is better than 176.4KHz, so that is what record companies market.

Quantization noise is unavoidable. No matter what format you digitize in, ultrasonic artifacts are created. The more bits you have, the lower the noise floor. Noise floor is lowered by roughly 6db for each bit. So as you can imagine, 1-bit DSD has significantly more ultrasonic noise than even 16-bit PCM. This is part of why wide-DSD formats with 5-bit to 8-bit parallel Delta-Sigma decoding were created. With PCM, you have to deal with significant noise at the sampling frequency. This is why Sony and Philips engineered the Red Book CD to sample at 44.1KHz, which is over twice the human high-frequency hearing limit of 20KHz.

Since quantization noise is present around the sampling frequency of a PCM recording, a 44.1KHz recording has quantization noise one octave above the human hearing limit of 20KHz. This quantization noise needs to be filtered out, so all DACs have a low-pass filter at the output. Because the quantization noise is only one octave above audibility the filters used have a very steep slope so as to not filter out desirable high frequencies. These steeply sloped low-pass digital filters are commonly known as "brick wall" filters. This is why there can be an advantage in playing 44.1KHz PCM upsampled to 88.2KHz or 176.4KHz.

Though you hear a lot about "brick wall" filters causing an audible distortion in the top end of early Red Book CD players , the fact is that was only a small part of the reason early Red Book CDs and players had an unnatural sounding top end. Most of the hard, harsh, unnatural sounding high frequencies in early digital had more to do with flaws in the power supplies and flaws in the recording process, not "brick wall" filters.

Sorry to be the one to burst your bubble, but despite what many audiophiles may believe, less than one person in a thousand can hear anything above 20KHz as a child and there is almost no one over the age of 40 who can hear much above 15KHz.

Of course DSD64 is another story: above 25KHz the quantization noise rises sharply, requiring far more sophisticated filters and/or noise-shaping algorithms. See graphic below. When you filter the output of DSD64 with a simple low-pass filter, the result is distorted phase/time and some rather nasty artifacts in the audible range. The solution is noise-shaping algorithms which move the noise to less audible frequencies and/or higher sampling rates. This is why Double-Rate DSD and Quad-Rate DSD formats came into being. This is also why advanced player software, such as JRiver, offers Double-Rate DSD output. Using player software that upsamples DSD64 to DSD128 or DSD256 significantly improves performance by putting the digital artifacts octaves above audibility allowing more advanced noise-shaping algorithms and less severe digital filters. Note these extremely high sampling frequencies are why ultra accurate clocking is more important in the playback of DSD than PCM recordings.

Jitter is defined as inconsistencies in playback frequency caused by inaccurate clocking. The result is observable as distortion of the time and tune of the music. Often the pattern of the inconsistency of frequency can result in an analog wave form that has an unnatural odd harmonic frequency. This results in the fatiguing character commonly known as “digititis.” Note in the two graphs below: jitter is an inconsistency in the horizontal time axis and non-linearity is an inconsistency in the vertical amplitude axis.

  

Jitter occurs when the converter’s clock rate is inconsistent and non-linearity can occur when the converter's reference voltage is inconsistent. This is why we are hearing so much about “super clocks” and “femto clocks.” The more accurate the clock, the more accurate the analog output. This is also why ultrahigh-performance R-2R DACs, such as Mojo Audio’s Mystique, have a way to adjust the voltage of the most-significant-bit (MSB) at the zero crossing to optimize linearity.


The Myth of Pure DSD:

Despite the marketing hype, there are almost no pure DSD recordings available to consumers. This is partially because up until quite recently there was no way to edit, mix, and master DSD files. So most pure DSD recordings which are commercially available are those recorded direct to DSD without any post-production. There are some new studio software packages which can edit, mix, and master in DSD, but these are quite rare in the industry, and mostly used by small boutique recording companies. Most DSD recordings are in fact, edited, mixed, and mastered in PCM and then converted back to DSD. The marketing hype DSD flow chart you see below rarely exists anywhere but in theory. Yikes…the secret is out.

There are several generations and levels of quality in purely digital DSD recordings. The least pure are DSD recordings made from old PCM masters. Many of these PCM masters had low resolution as well as significantly higher quantization errors and lower linearity than modern PCM recordings. Since you can never get better than the original masters, these DSD recordings sound as bad as or worse than the original low-resolution PCM masters. The purest common DSD recordings come from modern DSD masters which are recorded in 5-bit to 8-bit Wide-DSD, which is in fact a 5-bit to 8-bit parallel Delta-Sigma encoding.

As you can see from the above flow chart, most commercially available DSD recordings have to be converted back and forth to a PCM format in order to do post-production editing, mixing, and mastering. In each of these conversions, more quantization noise and/or quantization errors are added to the recording. For that reason they created these inaudible resolution 24-bit and Wide-DSD formats with insanely high sampling rates. The higher the resolution during editing, mixing, and mastering, the lower the digital noise in the audible spectrum when these recordings are downsampled to commercially available formats.

It is quite unlikely that any or many of recording studios that are currently using Wide-DSD for editing, mixing, and mastering will ever upgrade to software that can edit, mix, and master in true DSD, since DSD is in fact an obsolete format. Even Sony no longer supports DSD and SACD. The modern format which recording studios will likely be upgrading to would be MQA, which compresses much better than DSD or PCM for streaming and decodes to PCM formats, such as 24-bit 88.2KHz. That is why HD music streaming services such as Qobuz and Tidal are switching over to MQA for their ultra-HD selections. So with the invention of MQA compression, PCM is quickly becoming the preferred HD music format.

Another common marketing myth about DSD vs. PCM is that when blind listening tests were done comparing DSD to PCM, there was a consensus that PCM had a fatiguing quality and DSD had a more analog-like quality. This was proved to be total marketing BS. One way that marketing lie was perpetuated was with hybrid SACDs which have DSD64 and 16-bit 44.1KHz PCM on the same disk. The DSD64 tracks have over 30 times the resolution of the 16-bit 44.1KHz tracks so that they could make DSD sound better than PCM in comparisons. The truth is that in recent blind studies they've proved that high-resolution PCM and DSD are statistically indistinguishable from one another. Considering that nearly all DSD recordings were edited, mixed, and mastered in PCM, it is no wonder.

Then there are the differences in the ways DAC chips work. Most modern DAC chips are Delta-Sigma which decode native DSD. R-2R DAC chips decode native PCM. In order for you to play PCM files on a Delta-Sigma DAC or DSD files on an R-2R DAC the files have to be converted in real time.

Most modern Delta-Sigma DAC chips can decode multiple file formats, including PCM, DSD, and Wide-DSD. When they are decoding PCM, a Delta-Sigma DAC chip has to first convert it into DSD, the chip's native format. Another reason for the common misconception that DSD performs better than PCM has to do with the poor quality of the real-time PCM to DSD converters built into native DSD Delta-Sigma DACs. Since R-2R ladder DAC chips can only decode PCM formats some DAC manufacturers use chips or FPGAs at the input stages of their DACs which convert DSD to PCM. But no R-2R DAC chip can decode DSD on its own.

In almost all cases I would recommend playing music files in the native format which your DAC chip decodes. That would be PCM for an R-2R DAC chip and DSD for a Delta-Sigma DAC chip. There are several brands of player software on the market which have real-time PCM to Double-Rate DSD converters. HQ Player is one of the most sophisticated player software packages on the market today. HQ Player can be configured for real-time PCM to DSD conversion as well as real-time DSD upsampling to Double, Quad, Octuple, and even higher rate DSD formats. Using player software that is capable of converting PCM to DSD and upsampling it to at least Quad-Rate DSD is highly recommended.


Summary:

Well, all that’s a real ear opener, isn’t it?

When people claim to hear significant differences between PCM and DSD it is not the difference between the formats that they are hearing, but most often the difference in the quality of the digital remastering or the native format their specific DAC decodes. Delta-Sigma DACs decode native DSD and R-2R DACs decode native PCM.

Keep in mind that most recordings are engineered to sound best on a car stereo or portable device as opposed to on a high-end audiophile system. It’s a well-known fact that artists and producers will often listen to tracks on an MP3 player or car stereo before approving the final mix.

The quality of the recording plays a far more significant role than the format or resolution it is distributed in. But to increase profits, many modern recording studio executives insist that errors be edited out in post-production, significantly compromising the quality of the original master tapes. So no matter what format these recordings are released in, the music will always sound mediocre, since you can never have higher performance than what is on the original masters.

In contrast, some of my favorite digital recordings were digitally mastered from 1950s analog recordings. Many of these recordings were done as a group of musicians playing in a room with one take per track and no post-production editing. Though these recordings have much higher background noise being limited by old-school pre-Dolby 60dB dynamic range master tape, they retain an organic character and in-the-room harmonic cues that can't be duplicated any other way.


Hear It for Yourself:

Are you curious about the potential of digital-to-analog conversion?

Mojo Audio’s Mystique EVO DAC has the purest digital conversion possible.

  • A true non-oversampling R-2R ladder DAC design.
  • No noise-shaping, upsampling, or oversampling algorithms.
  • MSB zero-crossing voltage adjustment circuitry to optimize linearity.
  • Perfectly bit-aligned left and right channel hardware-based demultiplexing.
  • Direct-coupled with no output capacitors or transformers to distort phase and time or narrow bandwidth.
  • LC choke-input power supplies, which unlike capacitive power supplies, store both current and voltage.

The Mystique is in a class by itself. Explosive micro-dynamics combined with harmonically coherent micro-details reveal the true time, tune, tone, and timbre of the original musical performance.

With Mojo Audio’s 45-day no-risk audition you can hear the Mystique DAC for yourself, in your own system, with no-risk and no restocking fees. Experience all the harmonic coherency and emotional content digital music is capable of delivering.

If you like what you've read in this blog and are interested in getting more free tips and tricks, check out the rest of my blogs on our website. Also, sign up for our e-newsletter to get more useful info as well as discount coupons, special offers, and first looks at new products.

Enjoy!

Benjamin Zwickel
Owner, Mojo Audio

Source: https://www.mojo-audio.com/blog/dsd-vs-pcm-myth-vs-truth/