Tevatron Ionization Profile Monitor (IPM) Buffer Module
8 February 2010
Mark Bowden
Richard Kwarciany
Introduction
The IPM Buffer module receives data from the IPM QIE front end modules over fast serial optical links. The data is buffered and then read out through a standard 3.3 Volt PCI interface by a commercial PC. The difference in maximum input and output rates is approximately 10:1, so at least 10 seconds of data readout time are required for each second of data acquisition, or the data must be compressed/filtered. Additional time will be required for analysis, depending on the speed of the processor.
IPM Buffer Block Diagram
Implementation
The IPM system will use 16 optical links to supply data. Standard optical components are available in 12 channel increments, meaning that two IPM Buffer modules with one optical receiver each will be used.
IPM Buffer PCI Card
PCI Interface
The IPM Buffer uses the PLX PCI9656 interface. The PCI9656 runs at 64 bits/ 66MHz on the PCI side but is limited to 32 bit/ 66 MHz operation on the local bus. The maximum transfer rate is approximately 20 MBytes/sec, although it may be possible to run faster using two cards and transferring bursts of data in a “ping-pong” fashion.
PLX PCI9656
Advantages of the standalone PCI interface are;
1) it is active at power-up, and can be used to program the FPGA through PCI.
2) it has been used in previous Fermilab designs, so the development software exists and has been exercised.
PCI registers will be provided for configuration of the trigger, readout of the data, and diagnostic functions. In diagnostic mode, all of the main memory is accessible to the PCI host.
Memory
The IPM Buffer will hold two DDR SODIMM modules for a total of 0.5 GBytes of buffer memory. This represents approximately 0.4 seconds of continuous data.
The memory is arranged in two banks, with a total bandwidth of 2 GBytes/sec. This should be sufficient to handle the peak input data rate (1.5 GBytes/sec for 8 channels).
FPGA
The FPGA chosen for the IPM buffer is the Xilinx XC2VP20. This device has substantial internal resources (20,000+ registers and 1.5 Mbits of memory), allowing formatting and filtering to be done on-board the Buffer module before data is transferred to the PC. Each input link is decoupled by a 64 bit wide FIFO and the data is transferred to external memory in blocks of 32 64-bit words (2 KBytes).
Deserializers
The deserializers are included in the FPGA and are capable of accepting data rates in the range of 600-3000 Mbps. They require a local reference clock with a frequency very near (+/- 100 ppm) the intended data rate for initial lock. Following the initial frequency lock, the deserializers will relock to the actual frequency of the incoming data.
The local reference clock is provided by a crystal oscillator.
For reliable link operation, the data transmitter should run at a rate slightly faster than the required data transfer rate. This will cause insertion of an occasional idle character in the data stream, which helps to maintain word alignment in the receiver.
On command, the transmitter should also have the capability of sending a specific sync word (8B/10B comma character) for a period of at least 20 usec to establish receiver word synchronization.
All transmitters connected to the same IPM Buffer should operate at the same data rate (+/- 100 ppm).
Optical Links
The IPM data sources are assumed to use multimode (850 nm) optics. The transmitter connector type is not specified, since fanout cables are available in most common types.
Optical Receiver(s)
The receive end of the optical link uses the MTP/MPO style 12 channel connector. The receiver(s) areInfineon V23832R devices (second source Molex 86991). These receivers use a BGA array connector and are removable.
MTP (12 channel) parallel fiber
At the source end of the optical cable the 12 parallel fibers are split into individual fibers using either a short fanout adapter cable, or a fiber cassette.
Parallel fiber fanout cable.
Functional Description
Figure 1
Fiber Reciever
Referring to Figure 1, the Fiber Receivers are actually one discrete component which convert the optical input to aCML differential signal. The Infineon V23832R device used is a 12 channel device. Eight of the 12 channels are used in this implementation. Channels 8 through 11 are not connected.
SERDES
Each CML input channel is deserialized by a SERDES device internal to the FPGA. The SERDES produces a 16-bit wide parallel data stream.
FIFO
A two-stage FIFO memory on each input channel allows decoupling of the link clocks from the FPGA’s clock, and also formatting of the incoming data into 64-bit wide data frames. The FIFOs are large enough to allow the data from all 8serial links to be synchronized to each other. Since data on the links is organized into frames, synchronization of the eight links with each other is accomplished by comparing the frame header information on the links.
Trigger / Data Filter
Configured through registers in the PCI interface, the Trigger / Data Filter block allows for selective acquisition of data from the front end electronics. Up to 318 samples per turn can be stored by populating the sample masks through PCI. For each bit set true in the mask, data at that sample will be forwarded to the Memory Controller block for storage in memory. Individual channels are enabled or disabled in the Channel Mask Register. All data is stored if Data Filtering is disabled.
The Trigger / Data Filter block also controls triggering of data acquisition. The trigger logic is configured through PCI registers, and allows for saving a preset number of turns after an injection marker, or after a software data acquisition start command is issued. To acquire a preset number of turns, the number of turns is set in the N Turns register, then the N Turn Enable bit is set in the CSR register.
Counters
The Counters block contains the Proton sample counter, the Pbar sample counter, and the Turn counter. The Proton sample counter is a modulo-318 counter that is reset by the proton marker bit in the data header, and incremented on each sample. The Pbar sample counter is similar, but is reset by the pbar marker bit. The Turn counter is a 16-bit counter that is reset by the injection marker bit, and increments when the proton counter wraps to zero. The values of each of the three counters is added to the header information and stored with the data during data acquisition.
Memory Controller
The Memory Controller block provides for initialization and control of the DDR SDRAM memory that is external to the FPGA. The memory controller itself is internal to the FPGA. Write and Read requests are entertained from either the PCI interface, or from the Trigger / Data Filter blocks, depending on which mode the IPM Buffer is in. The mode is controlled via a bit in the CSR register. Two modes are defined. Acquisition Mode, and Readout Mode. When in Acquisition Mode, the Trigger / Data Filter block has exclusive access to the DDR SDRAM, and read attempts from PCI return DEADBEEF. Write attempts are ignored. When in Readout Mode, the PCI interface has exclusive access to the DDR SDRAM, and data from the input links is not stored.
Main Memory
The main memory is discrete from the FPGA, and consists of two 256 MByte DDR SDRAM SODIMM modules. Each module is treated as a separate bank by the Memory Controller block, but is accessed as one large bank via PCI. Main memory can be read or written via PCI when the IPM Buffer is in Readout Mode. Placing the module into Acquisition Mode allows data from the input links to be stored in Main Memory.
PCI Interface
The PCI interface is a dedicated PCI interface device that is discrete from the FPGA chip on the board. It supports 64-bit, 66 MHz PCI.
Diagnostic Features
Loopback Data Generator.
Each input serial link is equipped with a pattern generator that can be configured to transmit a known data pattern into the serial input. When enabled data frames of the required format are generated with an incrementing pattern to allow for confidence testing of the high speed serial links. The data generator transmits the following pattern:
Force Error Registers
To facilitate error reporting software development, the IPM Buffer module can be programmed to generate false error conditions. These error conditions are reported in the Link Error registers. Each of the individual links can be programmed to generate one or all of the 10 link error conditions by setting the mask bits in the Link Force Error Register.
Front End Diagnostic Mode
To facilitate debugging of front end boards, the IPM Buffer can be put into Front End Diagnostic Mode. When in this mode, raw footer data from the lowest four channel front end boards is copied into the IPM Buffer memory frame in place of the data for the highest four channel front end boards. Front End Diagnostic Mode is enabled by setting the Front End Diagnostic Mode bit in CSR0.
Input Data Optical Connector
The Input Data Optical Connector accepts a 12-fiber ribbon with an MTP connector. Fibers are numbered 0 – 11 with fiber 0 being defined as the bottom fiber when facing the IPM Buffer card from the rear of the host PC. Only fibers 0 through 7 are used in the IPM Buffer Module. Fibers 8 through 11 are not used.
The FONetworks 12-fiber breakout adapter breaks the fiber ribbon out into 12 individual fibers with LC type connectors. The individual fibers are color coded as follows.
Fiber Color / Fiber NumberDark Blue / Fiber 0
Orange / Fiber 1
Green / Fiber 2
Brown / Fiber 3
Gray / Fiber 4
White / Fiber 5
Red / Fiber 6
Black / Fiber 7
Yellow / Fiber 8
Purple / Fiber 9
Pink / Fiber 10
Light Blue / Fiber 11
Input Data Format
Link data received from the font-end boards adheres to the following encoding scheme:
IPM Buffer accepts data in the format above only. Data frames consist of seven contiguous 32-bit data words. Each frame contains data for three time slices, identified by the capID bits. Data frames may have fill characters between them, but a frame error will be set if fill a character occurs within a frame. Once each link’s SERDES is locked, at least one fill character is required for IPM Buffer to frame synchronize, but once synchronized, a string of data frames with no fill character separating them is allowed. It is recommended that a fill character be added between data frames occasionally to allow IPM Buffer to check for synchronization errors. A counter is implemented to flag an error if a fill character is not received for more than approximately 100 contiguous data words.
IPM Buffer deserializers accept any 32-bit k character sequence as a fill character, but for deserializer clock correction, the clock correction sequence should be transmitted approximately every 5000 bytes. This allows the deserializer to add or remove clock correction characters as necessary to compensate for small clock rate differences between the transmitter and the receiver. The defined clock correction sequence for IPM Buffer is:
k28.5 d16.2 k28.5 d5.6 .
Data in the frame is defined as follows:
QIExFront End QIE pseudo exponential data.
CAPIDTwo bit CAPID number from QIE chip.
ModeTwo bit Mode identifier from the QIE chip. Defined modes are as
follows:
00Calibration
01Normal
10Debug/Testing
11RESET Marker
Timing BitsFour front end timing bits defined as follows:
31Proton Injection
30PBar Injection
29Proton Marker
28PBar Marker
Error BitsThe following front end error bits are defined:
30– 25 Unused.
24PLL Lock Fail in Timeslice 2
23PLL Lock Fail in Timeslice 1
22PLL Lock Fail in Timeslice 0
21CAPID error in Timeslice 2
20CAPID error in Timeslice 1
19CAPID error in Timeslice 0
18FIFO Full in Timeslice 2
17FIFO Full in Timeslice 1
16 FIFO Full in Timeslice 0
Counter Data16-bit counter increments on each timeslice. This results in a
count value incrementing by three for each data frame.
Memory Data Format
Data from the eight input links is combined and stored in main memory in 80 byte frames. Each frame contains data from one timeslice.
Memory Data Frame Format
63 0Header Low
Header High
Link 0 Data
Link 1 Data
Link 2 Data
Link 3 Data
Link 4 Data
Link 5 Data
Link 6 Data
Link 7 Data
Header Low
63 – 56 / 55 – 54 / 53 – 52 / 51 / 50 / 49 - 41 / 40 - 32PLL Lock Error / QIE Mode / Cap ID / 0 / Counter Sync Error / Proton Counter / PBar Counter
31 – 24 / 23 – 16 / 15 – 0
FIFO Full Error / Cap ID Error / Turn Counter
Header High
63 – 56 / 55 – 48 / 47 – 40 / 39 – 32 / 31 – 24 / 23 – 16 / 15 – 8 / 7 – 0Counter Sync Error / Frame Error / QIE Mode Sync Error / Cap ID Sync Error / Proton Injection Error / PBar Injection Error / Proton Marker Error / PBar Marker Error
Header error bits in eight bit blocks contain one error bit for each link, with the lowest significant bit corresponding to the error occurring in link 0, the next lowest in link 1, etc.
Link x Data
QIE Data can be stored in Raw format (exactly as received from the front end board), or converted to an integer value using the conversion lookup table. The conversion lookup table is loaded and enabled through PCI (see: PCI Interface), and when enabled, causes the IPM Buffer to store an eight bit integer value into memory, instead of seven bit pseudo exponential value for each QIE channel.
Link x Data (RAW)
63 – 60 / 59 – 58 / 57 – 56 / 55 - 47 / 48 - 42 / 41 - 35 / 34 - 32Timing / QIE Mode / Cap ID / QIE 7 / QIE 6 / QIE 5 / QIE 4 (high)
31 – 28 / 27 – 21 / 20 – 14 / 13 – 7 / 6 – 0
QIE 4 (high) / QIE 3 / QIE 2 / QIE 1 / QIE 0
Link x Data (Integer Conversion Enabled)
63 – 56 / 55 – 48 / 47 – 40 / 39 – 32 / 31 – 24 / 23 – 16 / 15 – 8 / 7 – 0QIE 7 / QIE 6 / QIE 5 / QIE 4 / QIE 3 / QIE 2 / QIE 1 / QIE 0
Memory Data Frame Format when in Front End Diagnostic Mode
63 0Header Low
Header High
Link 0 Data
Link 1 Data
Link 2 Data
Link 3 Data
Link 0 RAW Front End Footer Data
Link 1 RAW Front End Footer Data
Link 2 RAW Front End Footer Data
Link 3 RAW Front End Footer Data
Link x RAW Front End Footer Data
63 – 56 / 55 – 48 / 47 – 40 / 39 – 32Frame Error, Hard Trigger and FE Init / Time Slice 2 Timing Mode and CapID / Time Slice 1 Timing Mode and CapID / Time Slice 0 Timing Mode and CapID
31 – 0
RAW Front End Footer
Frame Error, Hard Trigger, and Front End Init
63 / 62 / 61 / 60 – 56Frame Error / Hard Trigger / Front End Init / Unused (0x00000)
Time Slice x Timing Mode and CapID
7 / 6 / 5 / 4 / 3 – 2 / 1 – 0Proton Inj / PBar Inj / Proton Marker / PBar Marker / Mode / CapID
Interboard Synchronization
The IPM Buffer provides a mechanism for ensuring synchronization between two boards in the same host system. Connector J2 (top of pc board) provides for this function. A 20-pin ribbon cable jumper must be connected between the J2 connectors of the two boards. Once connected, one of the boards is designated the slave in software by setting the Interboard Slave Enable bit in the CSR register. The other board is designated the master by setting the Interboard Master Enable bit in the CSR register. Once properly configured and connected, the master IPM Buffer transmits the Turn Counter value of the event to trigger on to the slave. Both boards then acquire at the specified Turn Counter value. The master also passes the acquisition enable signal to the slave.
PCI Interface
CAUTION: The IPM Buffer board supports 3.3-Volt PCI ONLY. The board will physically fit into a 5-Volt PCI slot, but powering a system with an IPM Buffer module in a 5-Volt slot WOULD BE BAD. Fatal damage to the IPM Buffer, the system motherboard, power supply, or all of the above will result if attempted.
Note that 3.3 Volt PCI slots can be identified by the location of the key in the slot. 3.3 Volt PCI slots have the key near the back panel of the system, whereas 5 Volt PCI slots have the key nearest the end of the connector that is farthest from the back panel of the system. See figure above.
The following PCI registers are defined:
PCI Configuration Space:
00h / Device ID / Vendor ID (9656 10B5)04h / Status / Command
08h / Class Code / Rev ID(068000AD)
10h / Base Address of Memory access to PLX registers (PCIBAR0)
14h / Reserved (PCIBAR1)
18h / Base Address of Memory access to Readout Data Window (PCIBAR2 – Memory Space 0)
1Ch / Base Address of Memoryaccess to IPM CSR register block (PCIBAR3 – Memory Space 1)
20h / Reserved
24h / Reserved
28h / Reserved
2Ch / PCI Subsystem ID / PCI Subsystem Vendor ID (24400003)
30h – 53h / Reserved (do not write)
Memory Space 1 (Memory Space 1 Base = contents of PCIBAR3):
Mem1 Base + 00h / Control-Status registerMem1 Base + 04h / Memory window page register
Mem1 Base + 08h / Master Link Select Mask / Trigger Mask
Mem1 Base + 0Ch / Event Data Address Counter register
Mem1 Base + 10h / SERDES Control 0
Mem1 Base + 14h / SERDES Control 1
Mem1 Base + 18h / Link Status Register
Mem1 Base + 1Ch / N Turn Count
Mem1 Base + 20h / Link Error Register 0
Mem1 Base + 24h / Link Error Register 1
Mem1 Base + 28h / Link Error Register 2
Mem1 Base + 2Ch / Link Error Register 3
Mem1 Base + 30h / Link Diagnostic Register
Mem1 Base + 34h –
Mem1 Base + 3Ch / Reserved (DEADBEEF)
Mem1 Base + 40h / Link Error Register 4
Mem1 Base + 44h –
Mem1 Base + B0h / Reserved (DEADBEEF)
Mem1 Base + B4h / Acquisition Timer Register
Mem1 Base + B8h –
Mem1 Base + CCh / Reserved (DEADBEEF)
Mem1 Base + D0h / State Machine Low
Mem1 Base + D4h / State Machine Mid
Mem1 Base + D8h / State Machine High
Mem1 Base + DCh –
Mem1 Base + ECh / Reserved (DEADBEEF)
Mem1 Base + F0h / Firmware ID code (date in mmddyyyy format)
Mem1 Base + F4h / FPGA Firmware JTAG TDO bit
Mem1 Base + F8h / IPM Buffer FIFO Full Error Flags
Mem1 Base + FCh / Reserved (DEADBEEF)
Mem1 Base + 100h / Proton Mask Register 0 (proton mask bits 31 to 0)
Mem1 Base + 104h / Proton Mask Register 1 (proton mask bits 63 to 32)
Mem1 Base + 108h / Proton Mask Register 2 (proton mask bits 95 to 64)
Mem1 Base + 10Ch / Proton Mask Register 3 (proton mask bits 127 to 96)
Mem1 Base + 110h / Proton Mask Register 4 (proton mask bits 159 to 128)
Mem1 Base + 114h / Proton Mask Register 5 (proton mask bits 191 to 160)
Mem1 Base + 118h / Proton Mask Register 6 (proton mask bits 223 to 192)
Mem1 Base + 11Ch / Proton Mask Register 7 (proton mask bits 255 to 224)
Mem1 Base + 120h / Proton Mask Register 8 (proton mask bits 287 to 256)
Mem1 Base + 124h / Proton Mask Register 9 (proton mask bits 317 to 288)
Mem1 Base + 128h – 13Ch / Reserved
Mem1 Base + 140h / PBar Mask Register 0 (PBar mask bits 31 to 0)
Mem1 Base + 144h / PBar Mask Register 1 (PBar mask bits 63 to 32)
Mem1 Base + 148h / PBar Mask Register 2 (PBar mask bits 95 to 64)
Mem1 Base + 14Ch / PBar Mask Register 3 (PBar mask bits 127 to 96)
Mem1 Base + 150h / PBar Mask Register 4 (PBar mask bits 159 to 128)
Mem1 Base + 154h / PBar Mask Register 5 (PBar mask bits 191 to 160)
Mem1 Base + 158h / PBar Mask Register 6 (PBar mask bits 223 to 192)
Mem1 Base + 15Ch / PBar Mask Register 7 (PBar mask bits 255 to 224)
Mem1 Base + 160h / PBar Mask Register 8 (PBar mask bits 287 to 256)
Mem1 Base + 164h / PBar Mask Register 9 (PBar mask bits 317 to 288)
Mem1 Base + 168h – 1FCh / Reserved
Mem1 Base + 200h – Mem1 Base + 3FFh / QIE Conversion Lookup Table
Status / CSR Register (offset x00)