Configurable 2-D Linear Feedback Shift Registers for VLSI Built-in Self-test Designs

CHIEN-IN HENRY CHEN* and YINGJIE ZHOU

Department of Electrical Engineering, Wright State University, Dayton, OH 45435

(Received 20 May 1999; In final form 13 October 1999)

Recently a multiple-sequence test generator was presented based on two-dimensional linear feedback shift registers (2-D LFSR). This generator can generate a set of pre-computed test vectors obtained by an ATPG tool for detecting random-pattern-resistant faults and particular hard-to-detect faults. In addition, it can generate better random patterns than a conventional LFSR. In this paper we describe an optimized BIST scheme which has a configurable 2-D LFSR structure. Starting from a set of stuck-at faults and a corresponding set of test vectors detecting these faults, the corresponding test pattern generator is determined automatically. A synthesis procedure of designing this test generator is presented. Experimental results show that the hardware overhead is considerably reduced compared with 2-D LFSR generators.

Keywords: Linear feedback shift registers (LFSR); built-in self-test (BIST); pseudo-random testing; deterministic testing; fault detection and fault coverage

1. INTRODUCTION

Ever-increasing VLSI circuit density has made today's chip testing more complicated and costly. Built-in self-test (BIST) has been proven to be an effective approach to achieve sufficiently high, if not complete, fault coverage without any external automatic test equipment on site. It has revolutionized the way the integrated circuit chips can be tested by (1) reducing the cost of manufacturing testing by shortening the test application time, (2) minimizing the amount of test data stored, and (3) lowering the cost of test equipment. The implementation of BIST can result in a reduction of the product development cycle and cost, and the cost of system maintenance can be significantly reduced by testing chips, boards, and the entire system virtually using BIST.

Typical BIST architecture consists of a test pattern generator and a test response analyzer. The efficiency of BIST implementation can be characterized by test length and hardware overhead for a given fault coverage requirement which is defined as the ratio of the number of faults that can be detected to the total number of faults in the assumed fault domain. It is expected that BIST...
insertion to integrated circuit chips will result in high fault coverage, small volume of test data, at-system test speed, and compatibility with the design for testability techniques. High fault coverage can be achieved only if all faults of interest are detected and their effects are retained in the test response analyzer.

There have been various BIST techniques based on pseudo-random testing [1, 9], pseudo-exhaustive testing [2, 10, 11], weighted random testing [3, 12, 13], and reseeding of LFSR [5–7]. These approaches offer different trade-offs among test data volume, test application time, hardware overhead and fault coverage. One of the most popular approaches is the pseudo-random BIST, which employs an LFSR to generate a large number of test patterns with small hardware overhead. The major limitations with this method are the long test time and low fault coverage. Several approaches have been proposed to overcome these disadvantages, most of which embed a set of pre-computed test vectors, called deterministic patterns, to reach high fault coverage within a shorter time. Such deterministic patterns can be obtained by an automatic test pattern generation (ATPG) and implemented with a ROM to store the patterns, or the traditional LFSRs with pre-designed seeds to generate the patterns.

Recently, another method was presented to realize both pseudo-random pattern generation and deterministic pattern embedding using a 2-D LFSR [8]. With the two-dimensional feedback algorithm, it can achieve higher fault coverage than the conventional LFSR due to improved randomness. However, the hardware overhead is much higher than conventional LFSR for some practical circuits. In this paper, our focus is to reduce the hardware overhead by proposing a configurable 2-D LSFR based BIST scheme.

2. 2-D LINEAR FEEDBACK SHIFTER REGISTERS

A conventional LFSR can be used to generate a large number of pseudo-random test patterns with very small area overhead that is composed of exclusive-or gates (XORs) and flip-flops (FFs). The general structure is shown in Figure 1, which can be expressed with the polynomial in Eq. (1).

\[ P(V) = 1 + C_1 V + C_2 V^2 + \cdots + C_{M-1} V^{M-1} + V^M \]  

(1)

The coefficients \( C_i (i = 1 \sim M - 1) \) are zero or one. An LFSR goes through a cyclic sequence of states and the outputs produced are also periodic. The maximum length of this period is \( 2^M - 1 \), where \( M \) is the number of stages. So the patterns generated by an LFSR are pseudo-random and the random properties depend on its initial state and its generating function \( P(V) \). Once the generating function and the initial state are fixed, the patterns produced are also determined and in one period there is no recurrent patterns. It cannot produce a sequence of deterministic ordered patterns. The fault coverage is usually low because the random-pattern-resistant faults exist in some logic structures that can be detected only by deterministic ordered patterns. A number of the LFSR based techniques [5–7] have been developed to generate deterministic ordered test patterns to detect the random-pattern-resistant faults, but the encoding and decoding procedures and the implementation of control logic are relatively complex. A 2-D LFSR structure was presented in [8], which can first generate a deterministic sequence used to detect random-pattern-resistant faults, followed by random patterns used to detect a large percentage of easily testable faults. Figure 2 shows the general representation of a 2-D LFSR based on \( N \) primitive polynomials:

\[ V_i = \sum_{j=1}^{N} \sum_{k=1}^{M} a_{ijk} V_j D^k + C_i, \quad i = 1 \sim N \]  

(2)

This circuit consists of \( N \) shift registers, each of which has \( M \) stages. \( V_i (i = 1 \sim N) \) represents an \( N \)-bit vector, and \( V_j D^k (k = 1 \sim M) \) represents the \( k \)th delay of a vector \( V_i \). \( V_i \) is generated by the feedback network given by Eq. (2). If \( a_{ijk} = 1 \), the
signal $V_jD^k$ is connected to the XOR gate to generate $V_i$. Otherwise, there is no connection. If $C_i = 0$, the output of the XOR gate is connected to the shift register directly. If $C_i = 1$, then an inverter is added to the input of the shift register for generating a complementary feedback signal. Given $L$ vectors with a length of $N$-bit,

$$V_1 = b_{11}, b_{12}, b_{13}, \ldots, b_{1L}$$

$$V_2 = b_{21}, b_{22}, b_{23}, \ldots, b_{2L}$$

$$\ldots$$

$$V_N = b_{N1}, b_{N2}, b_{N3}, \ldots, b_{NL}.$$  

To store these vectors in a 2-D LFSR with minimum stage $M$, we construct Eq. (3) and obtain the solutions including determining the minimum value of $M$ by using the branch and bound method.

$$V_i(n) = \sum_{j=1}^{N} \sum_{k=1}^{M} a_{ijk} V_j(n-k) + C_i$$  \hspace{1cm} (3)

where $V_i(n) = b_{i,n}$, $V_j(n-k) = V_j(n)^D = b_{j,(n-k)}$, $i = 1 \sim N$ and $n = 1 \sim L$. When solving the first vector $V_i(1)$, $V_j(1-k)$ represents the initial states of the 2-D LFSR. A procedure to solve Eq. (3) is described below:

{} \hspace{1cm} M = 1; \hspace{1cm} /\ast M \text{ is the index count of time frame expansions } \ast /

Set initial states of 2-D LFSR;

While (aij's for any signal $V_i$ are not found) {
    if (aij's for each signal $V_i$ are found)
        return;
    for (each unsolved $V_i$ signal) {
        Build Equations; /* Formulate Eq. (3)
        Assign a set of $a_{ijk}$ values to Eq. (3);
        if ($a_{ijk}$ values are valid)
            mark signal $V_i$ SOLVED;
        else {
            use branch & bound method to obtain a new set of $a_{ijk}$ values;
            if ($a_{ijk}$ values are valid) {
                mark signal $V_i$ SOLVED;
            }
        }
    }
}

FIGURE 2 Two dimensional LFSR structure.
if ($V_i$ is not SOLVED) {
    Build_Inverted_Equations; /* Formulate Eq. (3) and set $C_i = 1$ */
    Assign a set of $a_{ijk}$ values to Eq. (3);
    if ($a_{ijk}$ values are valid)
        mark signal $V_i$ SOLVED;
    else {
        use branch & bound method to obtain a new set of $a_{ijk}$ values;
        if ($a_{ijk}$ values are valid) {
            mark signal $V_i$ SOLVED;
        }
    }
    } 
} /* next unsolved signal */

M = M + 1;
Set initial states of 2-D LFSR;
} /* next time frame */
} /* end */

Besides embedding the deterministic sequence, the 2-D LFSR can generate better pseudo-random patterns than a conventional LFSR according to the experimental results in [8]. The longer the deterministic sequence is, the more stages of a 2-D LFSR are required, which means more hardware required. For some practical circuits, a large overhead is unacceptable. In the next section, a feasible configuration scheme based on a 2-D LFSR to reduce the overhead is proposed.

3. CONFIGURABLE 2-D LFSR BASED TEST PATTERN GENERATOR

A configurable 2-D LFSR based test pattern generator is proposed to generate an embedded deterministic sequence of test patterns followed by pseudo-random patterns. The generator mainly consists of four types of function blocks—the flip–flop array (FFA), the configuration network (CN), the multiplexers (MUXs), and the control unit (CU) as shown in Figure 3. The FFA is an $N \times M$ flip–flop array, where $N$ is the number of inputs of a circuit under test (CUT) and $M$ is the number of stages of the 2-D LFSR. In order to reduce the hardware, $M$ is usually a small number. If Eq. (3) cannot be solved with a small value of $M$, the embedded test sequence is then partitioned into subsequences for obtaining solutions for a configurable scheme. Each CN consists of XOR gates and an inverter if necessary. The MUX block selects one of the $p$ configuration networks and feeds the feedback signal to the $N$-input FFA, where $p$ is the number of configurations. The MUX block is controlled by $\lceil \log_2 p \rceil$ signals as shown in Figure 3. A relatively simple CU generates these signals for the MUX and a reset signal for the FFA. When resetting the generator, the initial states of FFA are set to $\langle 1010\ldots \rangle$ in each column of the FFA.

The synthesis procedure for a configurable 2-D LFSR is shown in Figure 4 with two main tasks: (1) find a set of deterministic ordered patterns and (2) embed these patterns in the configurable 2-D LFSR. The HITEC [4] is used as an ATPG tool to generate a set of test patterns for detecting random-pattern-resistant faults. For some circuits, however, it is hard to find test patterns to detect random-pattern-resistant faults only. Therefore, all faults including random-pattern-detectable and random-pattern-resistant faults are considered in ATPG. With the test patterns generated by the ATPG, the coefficients $a_{ijk}$’s in Eq. (3) is solved so as to generate the sequence of test patterns. The number of stages, $M$, in the 2-D LFSR reflects the hardware complexity. Given a small number of $M$, Eq. (3) is sometimes not solvable; therefore, a modified algorithm is needed. For example, partitioning the original sequence of test patterns into consecutive subsequences of test patterns and then solving Eq. (3) for each subsequence becomes a feasible solution. As the length of each subsequence decreases, Eq. (3) can be solved with an acceptable small number of stages. While the number of LFSR stages decreases by the partitioning sequence, the complexity of the configuration network increases. Hence, it is necessary to optimize the configuration network for a less cost.
The following is an example to show the effectiveness of the configurable 2-D LFSR based generator. Given a deterministic ordered sequence with 18 6-bit patterns with the initial seed (101010), number will be 5 without the configuration structure as shown in Figure 6. The initial patterns of both generators are shown in Figures 5 and 6. The first test pattern (101001) in this example is generated with the initial pattern (101010) feeding both a configurable generator and a non-configurable generator are designed. Figure 5 shows the schematics of the generator with 1-stage registers and two configuration networks, whereas the stage through the configuration network CN1, and then is shifted into the 2-D LFSR. Other patterns are generated in sequence by following the same procedures until the ninth pattern (110100) feeding

\[
\begin{array}{cccccccccccccccc}
  t_1 & t_2 & t_3 & t_4 & t_5 & t_6 & t_7 & t_8 & t_9 & t_{10} & t_{11} & t_{12} & t_{13} & t_{14} & t_{15} & t_{16} & t_{17} & t_{18} \\
 1: & 1 & 0 & 1 & 0 & 0 & 0 & 0 & 1 & 1 & 1 & 0 & 0 & 1 & 0 & 1 & 1 & 1 & 0 \\
 2: & 0 & 1 & 1 & 0 & 1 & 0 & 1 & 1 & 1 & 0 & 0 & 0 & 1 & 0 & 0 & 0 & 0 & 1 \\
 3: & 1 & 1 & 1 & 1 & 0 & 0 & 0 & 1 & 0 & 0 & 1 & 1 & 0 & 0 & 0 & 1 & 0 & 0 \\
 4: & 0 & 1 & 1 & 1 & 1 & 0 & 0 & 0 & 1 & 0 & 0 & 1 & 1 & 0 & 0 & 0 & 1 & 0 \\
 5: & 0 & 0 & 1 & 1 & 1 & 0 & 1 & 1 & 0 & 1 & 0 & 1 & 1 & 0 & 0 & 1 & 0 & 0 \\
 6: & 1 & 0 & 1 & 1 & 1 & 1 & 0 & 0 & 0 & 1 & 1 & 1 & 1 & 0 & 1 & 0 & 1 & 0 \\
\end{array}
\]
generated. In this configurable generator, the CN1 generates the first 9 patterns, \(101001\), \(011100\), \(111111\), \(001111\), \(010111\), \(000001\), \(000000\), \(110100\) and \(110101\). The tenth pattern is generated with the ninth pattern \(110100\) feeding through the CN2. The CU with a modulo-9 counter controls this switch. With the optimization on the feedback network of Figure 5, an optimized configurable 2-D LFSR is shown in Figure 7.
FIGURE 5 A configurable 2-D LFSR based test pattern generator.

FIGURE 6 A 2-D LFSR test pattern generator.
4. EXPERIMENTAL RESULTS

Five synthesized circuits are used as benchmark circuits to evaluate the configuration structure in the 2-D LFSR scheme. The characteristics of the five circuits are summarized in Table I. Am2910 is a 12-bit microprogram sequencer, which has a stack of depth 5, a stack pointer, a microprogram counter, and a register counter (down counter). A 4-to-1 multiplexer selects the next microprogram address from either the input data bus, the stack, the register counter, or the microprogram counter. Mult16 is a 16-bit 2's complement multiplier, which uses a shift-and-add algorithm. The control unit has been implemented using 3-bit and 4-bit counters. There is one start instruction. Div16 is a 16-bit divider that uses repeated subtraction to perform division. A zero-detect unit checks for the divide-overflow condition and flags the termination of the repeated subtraction loop. There is one control instruction: Start. Pcont2 is an 8-bit parallel controller used in DSP applications. This circuit is composed of 8 adders, 2 fixed-point multipliers, and 3 MUXs—all combinational logic. Three registers serve as delay elements. A global reset line resets all the registers, and new data is input at every clock cycle. There is no control unit. Piir8 is an 8-point infinite impulse

<table>
<thead>
<tr>
<th>Circuits</th>
<th>Seq. depth</th>
<th>FFs</th>
<th>PIs</th>
<th>POs</th>
</tr>
</thead>
<tbody>
<tr>
<td>am2910</td>
<td>4</td>
<td>87</td>
<td>20</td>
<td>16</td>
</tr>
<tr>
<td>mult16</td>
<td>9</td>
<td>55</td>
<td>18</td>
<td>33</td>
</tr>
<tr>
<td>div16</td>
<td>19</td>
<td>50</td>
<td>33</td>
<td>34</td>
</tr>
<tr>
<td>pcont2</td>
<td>3</td>
<td>24</td>
<td>9</td>
<td>8</td>
</tr>
<tr>
<td>piir8</td>
<td>5</td>
<td>56</td>
<td>9</td>
<td>8</td>
</tr>
</tbody>
</table>
response filter for DSP applications. It has a very regular structure with 7 adders, 8 fixed-point multipliers, 7 8-bit registers, and 7 2-to-1 MUXs. The data input and output buses are 8 bits wide. There is no control unit. A global reset line is the only control signal. In Table II, the configuration and the original non-configuration results of the five benchmark circuits are compared. The first column is the number of embedded deterministic patterns detecting the random-resistant faults for each circuit. The second and third columns give the stage number of shift registers and the total number of flip-flops for the non-configurable test generator. The number of configurations, the stage number of shift registers and the total number of flip-flops for the configurable 2-D LFSR based generator are given in the fourth, fifth and sixth columns. For the am2910 and mul16 the flip-flops are reduced significantly to 15% and 14% respectively. Because the number of embedded patterns for div16, pcont2 and piir8 is small, the synthesis procedure results in no reconfiguration. Compared with the other four circuits, div16 has the largest FFA due to its large number of inputs. The minimum stage number of $M$ can be obtained by increasing the loop times in the branch and bound algorithm while solving Eq. (3) at a cost of CPU time. Table III shows the comparison of the fault. The test patterns generated by HITEC can be either stored in a ROM or scanned in. The results of the conventional LFSR generator are listed in the third column. Column four gives the results from a combination of the conventional LFSR and the additional deterministic ordered patterns. The data in the fifth and last columns are obtained by the 2-D LFSR and the configurable 2-D LFSR based generators respectively. The average number of equivalent faults of five circuits is 9,441 where 3,955 faults are detected by “HITEC” and 6,054 faults are detected by “Random Patterns”. Comparing “LFSR”, “LFSR+Add”, and “2-D LFSR”, the experimental results show that the average detected faults by “2-D LFSR” is 5.08% higher than that of “LFSR+Add” and 8.65% higher than that of “LFSR”. Note that the average number of faults detected by “2-D LFSR” is 5,952, which is only 1.76% less than that of “Random Patterns”. The average number of faults detected by “Configurable 2-D LFSR” is 5,986, which covers about the

<table>
<thead>
<tr>
<th>Circuits</th>
<th>Patterns</th>
<th>Stages</th>
<th>Flip-flops</th>
<th>Config.</th>
<th>Stages</th>
<th>Flip-flops</th>
</tr>
</thead>
<tbody>
<tr>
<td>2-D LFSR</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>am2910</td>
<td>30</td>
<td>13</td>
<td>260</td>
<td>2</td>
<td>2</td>
<td>40</td>
</tr>
<tr>
<td>mul16</td>
<td>72</td>
<td>14</td>
<td>252</td>
<td>4</td>
<td>2</td>
<td>36</td>
</tr>
<tr>
<td>div16</td>
<td>21</td>
<td>10</td>
<td>330</td>
<td>1</td>
<td>2</td>
<td>66</td>
</tr>
<tr>
<td>pcont2</td>
<td>7</td>
<td>2</td>
<td>18</td>
<td>1</td>
<td>2</td>
<td>18</td>
</tr>
<tr>
<td>piir8</td>
<td>19</td>
<td>3</td>
<td>27</td>
<td>1</td>
<td>3</td>
<td>27</td>
</tr>
<tr>
<td>Total</td>
<td>149</td>
<td>–</td>
<td>887</td>
<td>–</td>
<td>–</td>
<td>187</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>Circuits</th>
<th>Total faults</th>
<th>HITEC</th>
<th>Random patterns</th>
<th>LFSR</th>
<th>LFSR + Add</th>
<th>2-D LFSR</th>
<th>Configurable 2-D LFSR</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>Det</td>
<td>Vec</td>
<td>Det</td>
<td>Vec</td>
<td>Det</td>
<td>Vec</td>
<td>Det</td>
</tr>
<tr>
<td>am2910</td>
<td>2391</td>
<td>2142</td>
<td>726</td>
<td>2155</td>
<td>2126</td>
<td>2166</td>
<td>21166</td>
</tr>
<tr>
<td>mul16</td>
<td>1708</td>
<td>1568</td>
<td>89</td>
<td>1475</td>
<td>665</td>
<td>1395</td>
<td>1395</td>
</tr>
<tr>
<td>div16</td>
<td>2147</td>
<td>1676</td>
<td>190</td>
<td>1718</td>
<td>1679</td>
<td>1702</td>
<td>1702</td>
</tr>
<tr>
<td>pcont2</td>
<td>11272</td>
<td>3147</td>
<td>7</td>
<td>6815</td>
<td>6727</td>
<td>6727</td>
<td>6727</td>
</tr>
<tr>
<td>piir8</td>
<td>29689</td>
<td>11234</td>
<td>19</td>
<td>18106</td>
<td>16195</td>
<td>16331</td>
<td>16331</td>
</tr>
<tr>
<td>Avg.</td>
<td>9441</td>
<td>3955</td>
<td>209</td>
<td>6054</td>
<td>5478</td>
<td>5664</td>
<td>5664</td>
</tr>
</tbody>
</table>
same number of detected faults as the 2-D LFSR generator, but the overall overhead of the FFA has been reduced by 79%.

5. CONCLUSION

A BIST scheme based on a configurable 2-D LFSR and a procedure to design a test pattern generator have been presented. Without storage of test patterns and seeds, the 2-D LFSR test pattern generator can generate a sequence of deterministic test patterns as well as pseudo-random patterns. The hardware overhead of this BIST scheme decreased considerably through configuration. The experiment result shows that compared with the non-configurable 2-D LFSR, the number of flip-flops is reduced by 79% for five synthesized circuits. The average number of faults detected by the configurable 2-D LFSR is 9.27% higher than that of the conventional LFSR and 0.57% higher than that of the non-configurable 2-D LFSR.

In order to improve the fault coverage of the configurable 2-D test pattern generator, a set of embedded deterministic patterns with high fault coverage is needed. Logic optimization tools are also necessary to further reduce the hardware overhead of the feedback networks.

References


Authors’ Biographies

Chien-In Henry Chen received his B.S. degree from the National Taiwan University, Taiwan, in 1981, his M.S. degree from the University of Iowa, Iowa City, in 1986, and his Ph.D. degree from the University of Minnesota, Minneapolis, in 1989, all in electrical engineering. Since joining Wright State University in 1989 he has worked primarily in computer-aided design, timing and testing of very large scale integrated (VLSI) circuits, where he is currently an Associate Professor. He has written over 50 publications in professional journal and conference proceedings and is a technical reviewer for various journals and conferences. He was a technical committee member of the 1995 IEEE International ASIC Conference and Exhibit and was a plenary speaker in the 6th VLSI Design/CAD Symposium.

Yingjie Zhou received his B.S. degree in Electrical Engineering from Tsinghua University, Beijing, China. He was an IC design engineer of Beijing Intelligence Electronic Company, Beijing, China, 1995–98 and is currently a Ph.D. student at Wright State University, Dayton, Ohio. His research interests include VLSI circuit design, design for testability, and built-in self-test.