General measurements
Download the slides for this lesson.
Introduction
Measurements provide an interface between quantum and classical information. When a measurement is performed on a system in a quantum state, classical information is extracted, revealing something about that quantum state — and generally changing or destroying it in the process. In the simplified formulation of quantum information (as presented in the Basics of quantum information course), we typically limit our attention to projective measurements, including the simplest type of measurement: standard basis measurements. The concept of a measurement can, however, be generalized beyond projective measurements.
In this lesson we'll consider measurements in greater generality. We'll discuss a few different ways that general measurements can be described in mathematical terms, and we'll connect them to concepts discussed previously in the course.
We'll also take a look at a couple of notions connected with measurements, namely quantum state discrimination and quantum state tomography. Quantum state discrimination refers to a situation that arises commonly in quantum computing and cryptography, where a system is prepared in one of a known collection of states, and the goal is to determine, by means of a measurement, which state was prepared. For quantum state tomography, on the other hand, many independent copies of a single, unknown quantum state are made available, and the goal is to reconstruct a density matrix description of that state by performing measurements on the copies.
Mathematical formulations of measurements
The lesson begins with two equivalent mathematical descriptions of measurements:
- General measurements can be described by collections of matrices, one for each measurement outcome, in a way that generalizes the description of projective measurements.
- General measurements can be described as channels whose outputs are always classical states (represented by diagonal density matrices).
We'll restrict our attention to measurements having finitely many possible outcomes. Although it is possible to define measurements with infinitely many possible outcomes, they're much less typically encountered in the context of computation and information processing, and they also require some additional mathematics (namely measure theory) to be properly formalized.
Our initial focus will be on so-called destructive measurements, where the output of the measurement is a classical measurement outcome alone — with no specification of the post-measurement quantum state of whatever system was measured. Intuitively speaking, we can imagine that such a measurement destroys the quantum system itself, or that the system is immediately discarded once the measurement is made. Later in the lesson we'll broaden our view and consider non-destructive measurements, where there's both a classical measurement outcome and a post-measurement quantum state of the measured system.
Measurements as collections of matrices
Suppose is a system that is to be measured, and assume for simplicity that the classical state set of is for some positive integer so that density matrices representing quantum states of are matrices. We won't actually have much need to refer to the classical states of but it will be convenient to refer to the number of classical states of We'll also assume that the possible outcomes of the measurement are the integers for some positive integer Note that we're just using these names to keep things simple; it's straightforward to generalize everything that follows to other finite sets of classical states and measurement outcomes, renaming them as desired.
Recall that a projective measurement is described by a collection of projection matrices that sum to the identity matrix. In symbols, describes a projective measurement of if each is an projection matrix and the following condition is met.
When such a measurement is performed on a system while it's in a state described by some quantum state vector each outcome is obtained with probability equal to (We also have that the post-measurement state of is obtained by normalizing the vector but we're ignoring the post-measurement state for now.)
If the state of is described by a density matrix rather than a quantum state vector then we can alternatively express the probability to obtain the outcome as If is a pure state, then the two expressions are equal:
Here we're using the cyclic property of the trace for the second equality, and for the third equality we're using the fact that each is a projection matrix, and therefore satisfies In general, if is a convex combination
of pure states, then the expression coincides with the average probability for the outcome owing to the fact that this expression is linear in
A mathematical description for general measurements is obtained by relaxing the definition of projective measurements. Specifically, we allow the matrices in the collection describing the measurement to be arbitrary positive semidefinite matrices rather than projections. (Projections are always positive semidefinite; they can alternatively be defined as positive semidefinite matrices whose eigenvalues are all either 0 or 1.) In particular, a general measurement of a system having outcomes is specified by a collection of positive semidefinite matrices whose rows and columns correspond to the classical states of and that meet the condition
If the system is measured while it is in a state described by the density matrix then each outcome appears with probability
As we must naturally demand, the vector of outcome probabilities
of a general measurement always forms a probability vector, for any choice of a density matrix The following two observations establish that this is the case.
Each value must be nonnegative, owing to the fact that the trace of the product of any two positive semidefinite matrices is always nonnegative:
One way to argue this fact is to use spectral decompositions of and together with the cyclic property of the trace to express the trace of the product as a sum of nonnegative real numbers, which must therefore be nonnegative.
The condition together with the linearity of the trace ensures that the probabilities sum to
Example 1: any projective measurement
Projections are always positive semidefinite, so every projective measurement is an example of a general measurement.
For example, a standard basis measurement of a qubit can be represented by where
Measuring a qubit in the state results in outcome probabilities as follows.
Example 2: a non-projective qubit measurement
Suppose is a qubit, and define two matrices as follows.
These are both positive semidefinite matrices: they're Hermitian, and in both cases the eigenvalues happen to be which are both positive. We also have that and therefore describes a measurement.
If the state of is described by a density matrix and we perform this measurement, then the probability of obtaining the outcome is and the probability of obtaining the outcome is For instance, if then the probabilities for the two outcomes and are as follows.
Example 3: tetrahedral measurement
Define four single-qubit quantum state vectors as follows.
These four states are sometimes known as tetrahedral states because they're vertices of a regular tetrahedron inscribed within the Bloch sphere.
The Cartesian coordinates of these four states on the Bloch sphere are
which can be verified by expressing the density matrices representations of these states as linear combinations of Pauli matrices.
These four states are perfectly spread out on the Bloch sphere, each one equidistant from the other three and with the angles between any two of them always being the same.
Now let us define a measurement of a qubit by setting as follows for each
We can verify that this is a valid measurement as follows.
- Each is evidently positive semidefinite, being a pure state divided by one-half. That is, each one is a Hermitian matrix and has one eigenvalue equal to and all other eigenvalues zero.
- The sum of these matrices is the identity matrix: The expressions of these matrices as linear combinations of Pauli matrices makes this straightforward to verify.
Measurements as channels
A second way to describe measurements in mathematical terms is as channels.
Classical information can be viewed as a special case of quantum information, insofar as we can identify probabilistic states with diagonal density matrices. So, in operational terms, we can think about measurements as being channels whose inputs are matrices describing states of whatever system is being measured and whose outputs are diagonal density matrices describing the resulting distribution of measurement outcomes.
We'll see shortly that any channel having this property can always be written in a simple, canonical form that ties directly to the description of measurements as collections of positive semidefinite matrices. Conversely, given an arbitrary measurement as a collection of matrices, there's always a valid channel having the diagonal output property that describes the given measurement as suggested in the previous paragraph. Putting these observations together, we find that the two descriptions of general measurements are equivalent.
Before proceeding further, let's be more precise about the measurement, how we're viewing it as a channel, and what assumptions we're making about it. As before, we'll suppose that is the system to be measured, and that the possible outcomes of the measurement are the integers for some positive integer We let be the system that stores measurement outcomes, so its classical state set is and we represent the measurement as a channel named from to Our assumption is that is classical — which is to say that no matter what state we start with for the state of we obtain is represented by a diagonal density matrix.
We can express in mathematical terms that the output of is always diagonal in the following way. First define the completely dephasing channel on
This channel is analogous to the completely dephasing qubit channel from the previous lesson. As a linear mapping, it zeros out all of the off-diagonal entries of an input matrix and leaves the diagonal alone. And now, a simple way to express that a given density matrix is diagonal is by the equation In words, zeroing out all of the off-diagonal entries of a density matrix has no effect if and only if the off-diagonal entries were all zero to begin with. The channel therefore satisfies our assumption — that is classical — if and only if for every density matrix representing a state of
Equivalence of the formulations
Channels to matrices
Suppose that we have a channel from to with the property that for every density matrix This may alternatively be expressed as follows.
Like all channels, we can express in Kraus form for some way of choosing Kraus matrices
This provides us with an alternative expression for the diagonal entries of
for
Thus, for these same matrices we can express the channel as follows.
This expression is consistent with our description of general measurements in terms of matrices, as we see each measurement outcome appearing with probability
Now let's observe that the two properties required of the collection of matrices to describe a general measurement are indeed satisfied. The first property is that they're all positive semidefinite matrices. One way to see this is to observe that, for every vector having entries in correspondence with the classical state of we have
The second property is that if we sum these matrices we get the identity matrix.
The last equality follows from the fact that is a channel, so its Kraus matrices must satisfy this condition.
Matrices to channels
Now let's verify that for any collection of positive semidefinite matrices satisfying the mapping defined by
is indeed a valid channel from to
One way to do this is to compute the Choi representation of this mapping.
The transpose of each is introduced for the third equality because This allows for the expressions and to appear, which simplify to the identity matrix upon summing over and respectively.
By the assumption that are positive semidefinite, so too are (Transposing a Hermitian matrix results in another Hermitian matrix, and the eigenvalues of any square matrix and its transpose always agree.) It follows that is positive semidefinite. Tracing out the output system (which is the system on the right) yields
and so we conclude that is a channel.
Partial measurements
Suppose that we have multiple systems that are collectively in a quantum state, and a general measurement is performed on one of the systems. This results in one of the measurement outcomes, selected at random according to probabilities determined by the measurement and the state of the system prior to the measurement. The resulting state of the remaining systems will then, in general, depend on which measurement outcome was obtained.
Let's examine how this works for a pair of systems when the system is measured. (We're naming the system on the right because we'll take to be a system representing the classical output of the measurement when we view it as a channel.) We can then easily generalize to the situation in which the systems are swapped as well as to three or more systems.
Suppose the state of prior to the measurement is described by a density matrix which we can write as follows.
In this expression we're assuming the classical states of are
We'll assume that the measurement itself is described by the collection of matrices This measurement may alternatively be described as a channel from to where is a new system having classical state set Specifically, the action of this channel can be expressed as follows.
Outcome probabilities
We're considering a measurement of the system so the probabilities with which different measurement outcomes are obtained can depend only on the reduced state of In particular, the probability for each outcome to appear can be expressed in three equivalent ways.
The first expression naturally represents the probability to obtain the outcome based on what we already know about measurements of a single system. To get the second expression we're simply using the definition To get the third one requires more thought — and learners are encouraged to convince themselves that it is true. (Hint: the equivalence between the second and third expressions does not depend on being a density matrix or on each being positive semidefinite. Try showing it first for tensor products of the form and then conclude that it must be true in general by linearity.)
While the equivalence of the first and third expressions in the previous equation may not be immediate, it does make sense. Starting from a measurement on we're effectively defining a measurement of where we simply throw away and measure Like all measurements, this new measurement can be described by a collection of matrices, and it's not surprising that this measurement is described by the collection
States conditioned on measurement outcomes
If we want to determine not only the probabilities for the different outcomes but also the resulting state of conditioned on each measurement outcome, we can look to the channel description of the measurement. In particular, let's examine the state we get when we apply to and do nothing to
Note that this is a density matrix by virtue of the fact that is a channel, so each matrix is necessarily positive semidefinite.
One final step transforms this expression into one that reveals what we're looking for.
This is an example of a classical-quantum state,
like we saw in the Density matrices lesson. For each measurement outcome we have with probability that is in the classical state and is in the state
That is, this is the density matrix we obtain by normalizing by dividing it by its trace. (Formally speaking, the state is only defined when the probability is nonzero; when this state is irrelevant, for it refers to a discrete event that occurs with probability zero.) Naturally the outcome probabilities are consistent with our previous observations.
In summary, this is what happens when the measurement is performed on when is in the state
- Each outcome appears with probability
- Conditioned on obtaining the outcome the state of is then represented by the density matrix shown in the equation (2), which is obtained by normalizing
Generalization
We can adapt this description to other situations, such as when the ordering of the systems is reversed or when there are three or more systems. Conceptually it is straightforward, although it can become cumbersome to write down the formulas.
In general, if we have systems the state of the compound system is and the measurement is performed on , the following happens.
Each outcome appears with probability
Conditioned on obtaining the outcome the state of is then represented by the following density matrix.
Naimark's theorem
Naimark's theorem is a fundamental fact concerning measurements. It states that every general measurement can be implemented in a simple way that's reminiscent of Stinespring representations of channels: the system to be measured is first combined with an initialized workspace system, forming a compound system; then a unitary operation is performed on the compound system; and finally the workspace system is measured with respect to a standard basis measurement, yielding the outcome of the original general measurement.
Theorem statement and proof
Let be a system and let be a collection of positive semidefinite matrices satisfying that describes a measurement of Also let be a system whose classical state set is which is the set of possible outcomes of the given measurement.
Naimark's theorem states that there exists a unitary operation on the compound system so that the implementation suggested by the following figure yields measurement outcomes that agree with the given measurement meaning that the probabilities for the different possible measurement outcomes are precisely in agreement.
To be clear, the system starts out in some arbitrary state while is initialized to the state. The unitary operation is applied to and then the system is measured with a standard basis measurement, yielding some outcome Note that, in the figure, the system is pictured as part of the output of the circuit — but for now we won't concern ourselves with the state of after is performed, and we can alternatively imagine that it's traced out. An implementation of a measurement in this way is clearly reminiscent of a Stinespring representation of a channel, and the mathematical underpinnings are similar as well. The difference here is that the workspace system is measured rather than being traced out like in the case of a Stinespring representation.
The fact that every measurement can be implemented in this way is pretty simple to prove, but we're going to need a fact concerning positive semidefinite matrices first.
Theorem. Suppose is an positive semidefinite matrix. There exists a unique positive semidefinite matrix for which This unique positive semidefinite matrix is called the square root of and is denoted
One way to find the square root of a positive semidefinite matrix is to first compute a spectral decomposition.
Because is positive semidefinite, its eigenvalues must be nonnegative real numbers, and by replacing them with their square roots we obtain an expression for the square root of
With this concept in hand, we're ready to prove Naimark's theorem. Under the assumption that has classical states, a unitary operation on the pair can be represented by an matrix, which we can view as an block matrix whose blocks are The key to the proof is to take to be any unitary matrix that matches the following pattern.
For it to be possible to fill in the blocks marked with a question mark so that is unitary, it's both necessary and sufficient that the first columns, which are formed by the blocks are orthonormal. We can then use the Gram-Schmidt orthogonalization process to fill in the remaining columns, as we've already seen a couple of times in this series.
The first columns of can be expressed as vectors in the following way, where refers to the column number starting from
We can compute the inner product between any two of them as follows.
This shows that these columns are in fact orthonormal, so we can fill in the remaining columns of in a way that guarantees the entire matrix is unitary.
It remains to check that the measurement outcome probabilities for the simulation are consistent with the original measurement. For a given initial state of the measurement described by the collection results in each outcome with probability
To obtain the outcome probabilities for the simulation, let's first give the name to the state of after has been performed. This state can be expressed as follows.
Equivalently, in a block matrix form, we have the following equation.
Notice that the entries of falling into the blocks marked with a question mark have no influence on the outcome by virtue of the fact that we're conjugating a matrix of the form — the question mark entries are always multiplied by zero entries of when the matrix product is computed.
Now we can analyze what happens when a standard basis measurement is performed on The probabilities of the possible outcomes are given by the diagonal entries of the reduced state of
In particular, using the cyclic property of the trace, we see that the probability to obtain a given outcome is as follows.
This matches with the original measurement, establishing the correctness of the simulation.
Non-destructive measurements
So far in the lesson, we've concerned ourselves with destructive measurements, where the output consists of the classical measurement result alone and there is no specification of the post-measurement quantum state of the system that was measured. Non-destructive measurements, on the other hand, do precisely this. Specifically, non-destructive measurements describe not only the classical measurement outcome probabilities, but also the state of the system that was measured conditioned on each possible measurement outcome. Note that the term non-destructive refers to the system being measured but not necessarily its state, which could change significantly as a result of the measurement.
In general, for a given destructive measurement, there will be multiple (in fact infinitely many) non-destructive measurements that are compatible with the given destructive measurement, meaning that the classical measurement outcome probabilities match precisely with the destructive measurement. So, there isn't a unique way to define the post-measurement quantum state of a system for a given measurement. It is, in fact, possible to generalize non-destructive measurements even further, so that they produce a classical measurement outcome along with a quantum state output of a system that isn't necessarily the same as the input system.
The notion of a non-destructive measurement is an interesting and useful abstraction. It should, however, be recognized that non-destructive measurements can always be described as compositions of channels and destructive measurements — so there is a sense in which the notion of a destructive measurement is the more fundamental one.
From Naimark's theorem
Consider the simulation of a general measurement like we have in Naimark's theorem. A simple way to obtain a non-destructive measurement from this simulation is revealed by the figure from before, where the system is not traced out, but is part of the output. This yields both a classical measurement outcome as well as a post-measurement quantum state of
Let's describe these states in mathematical terms. We're assuming that the initial state of is so that after the initialized system is introduced and is performed, we have that is in the state
The probabilities for the different classical outcomes to appear are the same as before — they can't change as a result of us deciding to ignore or not ignore That is, we obtain each with probability
Conditioned upon having obtained a particular measurement outcome the resulting state of is given by this expression.
One way to see this is to represent a standard basis measurement of by the completely dephasing channel where the channel output describes classical measurement outcomes as (diagonal) density matrices. An expression of the state we obtain follows.
We can then write this state as a convex combination of product states,
which is consistent with the expression we've obtained for the state of conditioned on each possible measurement outcome.
From a Kraus representation
There are alternative selections for in the context of Naimark's theorem that produce the same measurement outcome probabilities but give entirely different output states of
For instance, one option is to substitute for where is any unitary operation on The application of to commutes with the measurement of so the classical outcome probabilities do not change, but now the state of conditioned on the outcome becomes
More generally, we could replace by the unitary matrix
for any choice of unitary operations on Again, the classical outcome probabilities are unchanged, but now the state of conditioned on the outcome becomes
An equivalent way to express this freedom is connected with Kraus representations. That is, we can describe an -outcome non-destructive measurement of a system having classical states by a selection of Kraus matrices satisfying the typical condition for Kraus matrices.
Assuming that the initial state of is the classical measurement outcome is with probability
and conditioned upon the outcome being the state of becomes
Note that this is equivalent to choosing the unitary operation in Naimark's theorem as follows.
In the previous lesson we observed that the columns formed by the blocks are necessarily orthogonal, by virtue of the condition
Generalizations
There are even more general ways to formulate non-destructive measurements than the ways we've discussed. The notion of a quantum instrument (which won't be described here) represents one way to do this.
Quantum state discrimination and tomography
In the last part of the lesson, we'll briefly consider two tasks associated with measurements: quantum state discrimination and quantum state tomography.
Quantum state discrimination
For quantum state discrimination, we have a known collection of quantum states along with probabilities associated with these states. (A succinct way of expressing this is to say that we have an ensemble of quantum states.) A number is chosen randomly according to the probabilities and the system is prepared in the state The goal is to determine, by means of a measurement of alone, which value of was chosen.
Thus, we have a finite number of alternatives, along with a prior — which is our knowledge of the probability for each to be selected — and the goal is to determine which alternative actually happened. This may be easy for some choices of states and probabilities, and for others, it may not be possible without some chance of making an error.
Quantum state tomography
For quantum state tomography, we have an unknown quantum state of a system — so unlike in quantum state discrimination there's typically no prior or any information about possible alternatives. This time, however, it's not a single copy of the state that's made available, but rather many independent copies are made available. That is, identical systems are each independently prepared in the state for some (possibly large) number The goal is to find an approximation of the unknown state, as a density matrix, by measuring the systems.
Discriminating between two states
The simplest case for quantum state discrimination is that there are two states to be discriminated: and
Imagine a situation which a bit is chosen randomly: with probability and with probability A system is prepared in the state meaning or depending on the value of and given to us. It is our goal to correctly guess the value of by means of a measurement on To be precise, we shall aim to maximize the probability that our guess is correct.
An optimal measurement
An optimal way to solve this problem begins with a spectral decomposition of a weighted difference between and where the weights are the corresponding probabilities.
Notice that we have a minus sign rather than a plus sign in this expression: this is a weighted difference not a weighted sum.
We can maximize the probability of a correct guess by selecting a projective measurement as follows. First let's partition the elements of into two disjoint sets and depending upon whether the corresponding eigenvalue of the weighted difference is nonnegative or negative.
We can then choose a projective measurement as follows.
(It doesn't actually matter in which set or we include the values of for which Here we're choosing arbitrarily to include these values in )
This is an optimal measurement in the situation at hand that minimizes the probability of an incorrect determination of the selected state.
Correctness probability
Now we will determine the probability of correctness for the measurement
To begin we don't really need to be concerned with the specific choice we've made for and though it may be helpful to keep it in mind. For any measurement (not necessarily projective) we can write the correctness probability as follows.
Using the fact that is a measurement, so we can rewrite this expression as follows.
On the other hand, we could have made the substitution instead. That wouldn't change the value but it does give us an alternative expression.
The two expressions have the same value, so we can average them to give yet another expression for this value. (Averaging the two expressions is just a trick to simplify the resulting expression.)
Now we can see why it makes sense to choose the projections and (as specified above) for and respectively — because that's how we can make the trace in the final expression as large as possible. In particular,
So, when we take the trace, we obtain the sum of the absolute values of the eigenvalues — which is equal to what's known as the trace norm of the weighted difference.
Thus, the probability that the measurement leads to a correct discrimination of and given with probabilities and respectively, is as follows.
The fact that this is the optimal probability for a correct discrimination of and given with probabilities and is commonly referred to as the Helstrom-Holevo theorem (or sometimes just Helstrom's theorem).
Discriminating three or more states
For quantum state discrimination when there are three or more states, there is no known closed-form solution for an optimal measurement, although it is possible to formulate the problem as a semidefinite program — which allows for efficient numerical approximations of optimal measurements with the help of a computer.
It is also possible to verify (or falsify) optimality of a given measurement in a state discrimination task through a condition known as the Holevo-Yuen-Kennedy-Lax condition. In particular, for the state discrimination task defined by the ensemble the measurement is optimal if and only if the matrix
is positive semidefinite for every
For example, consider the quantum state discrimination task in which one of the four tetrahedral states is selected uniformly at random. The tetrahedral measurement succeeds with probability
This is optimal by the Holevo-Yuen-Kennedy-Lax condition, as a calculation reveals that for
Quantum state tomography
Finally, we'll briefly discuss the problem of quantum state tomography. For this problem, we're given a large number of independent copies of an unknown quantum state and the goal is to reconstruct an approximation of To be clear, this means that we wish to find a classical description of a density matrix that is as close as possible to
We can alternatively describe the set-up in the following way. An unknown density matrix is selected, and we're given access to quantum systems each of which has been independently prepared in the state Thus, the state of the compound system is
The goal is to perform measurements on the systems and, based on the outcomes of those measurements, to compute a density matrix that closely approximates This turns out to be a fascinating problem and there is ongoing research on it.
Different types of strategies for approaching the problem may be considered. For example, we can imagine a strategy where each of the systems is measured separately, in turn, producing a sequence of measurement outcomes. Different specific choices for which measurements are performed can be made, including adaptive and non-adaptive selections. In other words, the choice of what measurement is performed on a particular system might or might not depend on the outcomes of prior measurements. Based on the sequence of measurement outcomes, a guess for the state is derived — and again there are different methodologies for doing this.
An alternative approach is to perform a single joint measurement of the entire collection, where we think about as a single system and select a single measurement whose output is a guess for the state This can lead to an improved estimate over what is possible for separate measurements of the individual systems, although a joint measurement on all of the systems together is likely to be much more difficult to implement.
Qubit tomography using Pauli measurements
We'll now consider quantum state tomography in the simple case where is a qubit density matrix. We assume that we're given qubits that are each independently in the state and our goal is to compute an approximation that is close to
Our strategy will be to divide the qubits into three roughly equal-size collections, one for each of the three Pauli matrices and Each qubit is then measured independently as follows.
For each of the qubits in the collection associated with we perform a measurement. This means that the qubit is measured with respect to the basis which is an orthonormal basis of eigenvectors of and the corresponding measurement outcomes are the eigenvalues associated with the two eigenvectors: for the state and for the state By averaging together the outcomes over all of the states in the collection associated with we obtain an approximation of the expectation value
For each of the qubits in the collection associated with we perform a measurement. Such a measurement is similar to a measurement, except that the measurement basis is the eigenvectors of Averaging the outcomes over all of the states in the collection associated with we obtain an approximation of the expectation value
For each of the qubits in the collection associated with we perform a measurement. This time the measurement basis is the standard basis the eigenvectors of Averaging the outcomes over all of the states in the collection associated with we obtain an approximation of the expectation value
Once we have obtained approximations and by averaging the measurement outcomes for each collection, we can approximate as
In the limit as approaches infinity, this approximation converges in probability to the true density matrix by the Law of Large Numbers, and well-known statistical bounds (such as Hoeffding's inequality) can be used to bound the probability that the approximation deviates from by varying amounts.
An important thing to recognize, however, is that the matrix obtained in this way may fail to be a density matrix. In particular, although it will always have trace equal to it may fail to be positive semidefinite. There are different known strategies for "rounding" such an approximation to a density matrix, one of them being to compute a spectral decomposition, replace any negative eigenvalues with and then renormalize (by dividing the matrix we obtain by its trace).
Qubit tomography using the tetrahedral measurement
Another option for performing qubit tomography is to measure every qubit using the tetrahedral measurement described earlier. That is,
for
Each outcome is obtained some number of times, which we will denote as for each so that The ratio of these numbers with provides an estimate of the probability associated with each possible outcome:
Finally, we shall make use of the following remarkable formula:
To establish this formula, we can use the following equation for the absolute values squared of inner products of tetrahedral states, which can be checked through direct calculations.
Now, the four matrices
are linearly independent, so it suffices to prove that the formula is true when for In particular,
and therefore
We arrive at an approximation of
This approximation will always be a Hermitian matrix having trace equal to one, but it may fail to be positive semidefinite. In this case, the approximation must be "rounded" to a density matrix, similar to the strategy involving Pauli measurements.
Was this page helpful?