# CMOS Image Sensor Design and Image Processing Algorithm Implementation for Total Hip Arthroplasty Surgery

Syed Mudassir Hussain, Fasih Ud Din Farrukh , Shaojie Su, Zhihua Wang, *Fellow, IEEE*, and Hong Chen , *Senior Member, IEEE* 

Abstract—As the average age of the population goes higher, the people undergo hip arthroplasty surgery to replace their stiff and painful damaged hip joints. To reduce the risk factor after the surgery we developed a visual aided system in previous works. In order to solve the integration problems of commercial camera with other sensors into the femoral head and to minimize the area and power consumption, in this paper we propose a CMOS image sensor of resolution 200 imes 200 specially designed for the application in which each individual pixel measures around 15  $\mu$ m imes 15  $\mu$ m in size and the image sensor chip size measures about 3.5 mm  $\times$ 3.5 mm. The proposed sensor is simulated with the input current variations from 2 pA to 100 pA for the individual pixels and the corresponding measurements for each pixel range from 2 mV to 855 mV. Besides, we put forward a new method of pattern detection and recognition in the blood-covered situation, which provides an accurate segmentation of patterns from the blood. All the detected patterns are recognized by generating its right 9-bit binary ID required for the pose estimation calculation. Furthermore, to reduce system power consumption, we implement algorithms on FPGA to process the image data pixel by pixel and transmit it directly to the computer for post-processing. Experimental results show that the pattern detection rate goes as high as 99%, which is 5% better in accuracy compared to the top hat algorithm. The power consumption of the system is 213 mW, which is a 70% decrease compared to our previous work.

*Index Terms*—FPGA, image processing, patterns detection, total hip arthroplasty.

## I. INTRODUCTION

ITH an increase in the average age of the population, a variety of diseases like arthritis and many other human activities make the hip joint stiff and extremely painful. The ultimate solution to alleviate pain and regain mobility is to replace

Color versions of one or more of the figures in this article are available online at http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/TBCAS.2019.2946867

Example 2 Cartilage (a) Healthy Joint (b) Damaged Joint

Fig. 1. Hip joint.

the damaged hip joint [1]. Fig. 1 depicts a healthy and damaged joint, smooth rubbery cartilage covers the end of bones, letting the bones glide against each other easily. Whereas the damaged cartilage usually by arthritis and bone rub against each other makes the joint painful and stiff. According to the annual report published by the American Academy of Orthopedic Surgeons (AAOS) and American Joint Replacement Registry (AJRR) on hip and knee arthroplasty from the year 2012–2017. More than 443,219 various hip procedures including hip resurfacing, partial hip replacement or Hemiarthroplasty, hip revision and total hip replacement are performed in the United States, and the number is forecasted to increase in the next two decades. Total hip replacement (THR) procedure has been reported to be 79.2% of all the hip arthroplasty procedures that are quite significant [2].

However, the total hip replacement arthroplasty surgeries have been reported a failure rate of about 10% which may cause intense complications such as dislocation, prosthetic impingement, infections, intraoperative fracture, leg length discrepancy among other complications of wear and loosening [3]. Common failures of THR are prosthetic impingement and dislocation whose occurrence is estimated between 0.5% to 11% [4], [5]. During THR surgery, placement of prosthetic components into the safe zone is a key step [6], if not placed properly it may lead to dislocation of the components and cause a limited hip range of motion (ROM) [7]. Therefore, surgeons strongly need a visualizing system in order to place prosthetic components into

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/

Manuscript received July 16, 2019; revised August 26, 2019 and September 29, 2019; accepted October 1, 2019. Date of publication October 11, 2019; date of current version December 31, 2019. This work was supported in part by the National Natural Science Foundation of China under Grant 61674090, in part by the National Key Research Plan under Grant 2018AAA0103102, and in part by Beijing Engineering Research Center under Grant BG0149. This paper was recommended by Associate Editor Dr M. Atef. (*Corresponding author: Hong Chen.*)

The authors are with the Institute of Microelectronics, Tsinghua University, Beijing National Research Center for Information Science and Technology, and Beijing Engineering Center of Technology and Research on Wireless Medical and Health System, Beijing 10084, China (e-mail: mdxe16@mails.tsinghua. edu.cn; fa-s17@mails.tsinghua.edu.cn; susj12@mails.tsinghua.edu.cn; zhihua @tsinghua.edu.cn; hongchen@tsinghua.edu.cn).

the safe zone during hip joint arthroplasty. Prosthetic component placement into the safe zone has been presented and discussed in [6], three-dimensional (3D) models designed by using computed tomography (CT) images helped surgeons to present motion restriction of the hip joint [8]. However, the preoperative system decision may need to be modified because of the possibility of uncertainty to occur during the surgery. Although the CT based navigation system has provided a significant development in the implantation of prosthetic components into the safe zone [9], but it cannot be used as a standard operating technique to position the prostheses, because it may cost an additional operation time, more money and furthermore the patients are exposed to the harmful radiations [9].

Orthopilot surgical robot has provided a CT free navigation system for more precise, accurate and reliable prostheses implants, where the patients do not need to undergo preoperative scanning requirements [10]. Nevertheless, it needs an extra marker affixing unit to the bony landmarks, hence making the calibration, data processing and collection more complex, which may cause extra damage and infections to the patients as well [11]. Moreover, it is quite expensive. In our work [12], we have designed a visually aided wireless monitoring system to estimate the relative pose between the femoral head prosthesis and acetabular cup prosthesis during the hip joint arthroplasty surgery. This CT free smart trial helps the surgeons to visualize the positioning of prosthetic components into the safe zone without creating bony anatomical landmarks during surgery; moreover, it is also cost-effective. Power consumption by the system is only 46.8 mW but it is hard to mount enough contact sensors on the femoral head prosthesis providing a limited range of motion inside the acetabular cup thereby increasing the system mean error to 4.40 degrees, hence making the system far from its practical use.

Later, to deal with the problems in mounting enough contact sensors onto the femoral head prosthesis. We proposed a design of monocular vision and IMU-based visual aided system [13], the system consists of a miniature camera inside the femoral head prosthesis from Omni Vision (OV7660) to capture images inside the acetabular cup, and an inertial measurement unit IMU for more precise pose estimation. The commercial camera module used in our previous works has a package dimension of 4.2 mm  $\times$  3.9 mm with an optical size 1/5 inch and active power consumption by the module is 40 mW without loading. Using the camera inside the femoral head to take images from the acetabular cup and then transmitting the data through a Wi-Fi module to the host computer to perform processing off-chip takes a lot of execution time and power. The power consumption by a monocular vision-based system is about 980 mW and with IMU the system consumes 768 mW, which is relatively high. This system can work for as long as 15 minutes. Furthermore, uncertain light conditions inside the acetabular cup and the blood interference during surgery may lead to the failure of a vision-based system where pose estimation is based on accurate detection of the features and coordinates of the customized patterns. Therefore, we need to design a system on chip to capture and process the image data on-chip inside the acetabular cup rather than sending the data off-chip for processing to



Fig. 2. Vision based smart trial.

minimize the modules integration problem with the least power consumption to prolong system operation time.

The system architecture of our proposed smart trail is illustrated in Fig. 2, where our application-specific CMOS image sensor replaces the commercial camera from Omni Vision. The system comprises an application-specific CMOS image sensor of resolution  $200 \times 200$  inside a femoral head prosthesis with other sensor circuits to capture the images from the acetabular cup. Whereas the acetabular cup has customized reference patterns on its inner liner. The captured images are further processed from uneven illumination and blood interference conditions on the hardware interface for on-chip image processing. The pose estimation calculation between the femoral head prosthesis and the acetabular cup is performed off-chip using software developed with a visual studio on the computer.

In this paper, we first present the design and simulation of CMOS image sensor for our smart trial. The proposed image sensor replaces a bulky camera from the femoral head, which consumes most of the system power. Secondly, during hip arthroplasty surgery there is a possibility of blood interference on the sight of the camera, which may lead to the failure in estimating the exact pose of femoral head prosthesis inside the acetabulum. An efficient image-processing algorithm is developed to detect the features and coordinates of each pattern from the blood-covered images. The method is compared with the top-hat algorithm [13]. Finally, the algorithms are implemented on FPGA for the ASIC development to provide an on-chip image processing. Performance and power consumption by the methods are compared.

The rest of the paper is organized as follows: The CMOS imaging system architecture is presented in Section II and image processing algorithm for pattern detection and recognition are discussed in Section III, the implementation of image processing and its hardware demonstration results are explained in Section IV. FPGA experimental results and discussions are given in Section V. Finally, Section VI concludes the paper.

#### **II. IMAGING SYSTEM ARCHITECTURE**

The proposed imaging system architecture is presented in Fig. 3, in which the image sensor is composed of 4T pixels



Fig. 3. Imaging system architecture.



Fig. 4. 4T schematic view and layout of the pixel.

structure of an array size  $200 \times 200$ . Since the application requirement is dynamic illumination conditions with minimum power consumption and better integration capability CMOS technology is considered the right choice for the sensor design [14]. CMOS image sensors can be categorized as either operating in current mode or voltage mode depending on the type of readout circuit. In a voltage mode, when the electrons are transformed from charge to voltage domain they are accumulated on the floating diffusion (FD) node and are read out by the source follower (SF) [14]. Whereas in the current mode, either the readout transistor is biased in the velocity saturation region consuming large power because of high V<sub>ds</sub> [15] or linear region where fixed pattern noise (FPN) is critical for linearity [16].

Voltage mode pixels are implemented in 0.18  $\mu$ m 1P6M process technology. Vertical chain of DFFs and the digital logic circuits are used to generate global control signals to control the pixels connected in each row. To read the output values of the signal generated by each pixel is selected by the chain of DFFs and digital logic connected column-wise in the sensor array, the output will be finally buffered and digitalized using an off-chip ADC. Fig. 4 depicts the 4T structure of our pixel, which



Fig. 5. Timing control signals.

comprises a photodiode, transfer switch (TX), reset switch (RS), source follower (SF) switch and a row select (RS) switch. The switches are controlled by the global control signals generated by the row control and column control circuits can be referred to our previous work [17]. Each unit pixel designed with a conventional 4T (four transistors) structure and a photodiode is designed to be around 10  $\mu$ m × 10  $\mu$ m making the total pixel size of about 15  $\mu$ m × 15  $\mu$ m to detect the low light variations each pixel has a fill factor of 60%.

The timing sequence to control the array of pixels and generate an output signal is illustrated in Fig. 5. RS\_IN is an input pulse signal to the vertical DFF chain, RS\_CLK signal controls the respective rows in an array, and the clock frequency is set according to the number of rows in the sensor array. In a respective row cycle, all the pixels within a row are reset with a RST\_GLOBAL signal, and TX\_GLOBAL signal is used to turn on the gate of transfer switch for all the pixels. DFF chain and digital AND logic are used to control the pixels connected in rows. At the bottom of the imaging system horizontal chain of DFFs is used to generate CS\_IN, a column control pulse. CS\_CLK signals are produced to scan all the pixels within a respective row time to read the output signals column-wise. By changing the input signals the corresponding change at the output is observed.

## A. Simulation Results for the Sensor

In this application, the image sensor can be used under dynamic light conditions, specially to the low light situations. To verify and analyze the sensor performance under different illumination conditions, we designed a capacitor and current source combination in a photogeneration block to model a circuit for each photodiode (PD) within a pixel to produce different input pixel signals. The output pixel values are verified with the corresponding change at the input stage. Different pixels array sizes have been implemented to test the design methodology. The output waveforms for  $4 \times 4$  and  $50 \times 50$  sensor array sizes are shown in Fig. 6(a) and (b) respectively, from which we can see that the  $4 \times 4$  and  $50 \times 50$  pixels array generate steps signal at the output, the design shows similar performances with  $100 \times$ 100 array size and  $200 \times 200$  array size. The output voltage from the pixels ranges from 2 mV to 855 mV for the corresponding





Fig. 6. Simulation results from  $4 \times 4$  and  $50 \times 50$  Array.

input values from the photodiode, the input signal variations ranges from 2 pA to 100 pA. The final output shows good linear performance for an input FD voltage of large swing varying from 0.09 V to 1.44 V with a power consumption of 20.8 mW. The linearity of the pixel can be further improved by choosing suitable design parameters, such as a smaller size of the SF and FD.

The image sensor module chip micrograph is shown in Fig. 7, the image sensor measures around 3.5 mm  $\times$  3.5 mm with pad frame, each unit pixel size measures around 15  $\mu$ m  $\times$  15  $\mu$ m big enough to detect low light signal variations [17]. Camera used in [13] is too bulky and it is hard to mount onto the femoral head and integrate with the other modules consuming most of the system power. The sensor size proposed in this paper is more compact and is designed to be integrated with an on-chip image processing ASIC for the SoC development. Table I presents the design summary of our image sensor, in which we can find the pixels resolution for the finished design is 200  $\times$  200 with TSMC 0.18  $\mu$ m process, which can be extended to any array size hence making the whole design flexible for different image processing algorithms.



Fig. 7. Sensor chip micrograph.

TABLE I CMOS IMAGE SENSOR DESIGN SUMMARY

| Process           | TSMC 0.18µm 1-poly 6-metal<br>standard CMOS process<br>technology |  |  |
|-------------------|-------------------------------------------------------------------|--|--|
| Pixels Resolution | 200 x 200                                                         |  |  |
| Pixel Size        | 15μm x 15μm (4T per pixel)                                        |  |  |
| Fill Factor       | 60%                                                               |  |  |
| Chip Size         | 3.5mm x 3.5 mm                                                    |  |  |
| Supply Voltage    | 1.8/3.3                                                           |  |  |



Fig. 8. (a) Pose estimation platform. (b) Blood covered sample image.

## **III. IMAGE PROCESSING FOR PATTERNS DETECTION**

#### A. Method for Patterns Detection

The pose estimation platform is depicted in Fig. 8, in which a camera is used to take the images inside the acetabular cup prosthesis [18], and liner inside the acetabular cup has some reference patterns in square shape and inside each square there is a combination of  $3 \times 3$  small blocks colored white or black representing "1" or "0" respectively. Each pattern has



Fig. 9. Pattern dettion method.

Fig. 10. Pattern detection.

Input Image

its specific 9-bit binary ID and 4 vertexes, which are adopted to estimate the pose of the femoral head inside the acetabular cup. During outside body experiments, the light in the environment is enough and the images of reference patterns have uniform illumination the system works well. However, under dynamic light conditions and blood interference, the visual aided system may fail due to unrecognizable patterns. Therefore, an accurate segmentation method is required to detect and recognize each pattern with all four vertexes and features points to match with system initialized feature points for post-processing during THR surgery. Conventional monochrome image segmentation involve edge detection, histogram thresholding and region-based image segmentation use of these classical methods is limited because of low accuracy, while some other specific theories involve segmentation based on mathematical morphology, statistics and artificial neural network [19] are considered complex.

There are some techniques related to the segmentation of plants from the soil discussed in [20] and the segmentation of optic disc and optic cup in the detection of glaucoma using RGB color space [21]. In our application, we need to detect the patterns and recognize the binary IDs of the customized patterns, so we are proposing a new method to perform both the operations.

A sample image obtained from the pose estimation platform depicted in Fig. 9 is taken as an input image. We have used red ink in place of blood for our experiment to simulate a blood-covered situation. After selecting a red color in the image, an image enhancement technique is applied. In the image, edges for the patterns need to be preserved in order to detect it properly, for noise suppression and preserving the edges we applied a median filter, the generalized equation for the median filter [22] can be expressed as:

$$\widehat{f}(x,y) = \operatorname{median}_{(s,t)\in S_{xy}} \left\{ g(s,t) \right\},\tag{1}$$

where  $\hat{f}(x, y)$  represents a restored image and  $S_{xy}$  represents the set of coordinates. In a square sub-image window,  $3 \times 3$ (s, t) represents the pixel coordinates.

By considering the image sub-window of  $3 \times 3$  square neighborhood the median filter calculates the median value by sorting all the pixel values from the surrounding neighborhood into numerical order and then replaces the pixel value under consideration with the median. Contrast variation, degraded image and uneven light illumination conditions where the pixels cannot

be classified as a foreground or background locally adaptive thresholding is the right option, thresholding is always a vital part in an image binarization [23], T(x, y) represents threshold and it can be expressed as follows:

$$b(x,y) = \begin{cases} 0, & \text{if } I(x,y) \le T(x,y), \\ 1, & \text{otherwise,} \end{cases}$$
(2)

$$T(x,y) = I_o(x,y) + T_o,$$
 (3)

b(x, y) represents a binary image and  $I(x, y) \in [0, 1]$  specifies the intensity of the pixel at any point location (x, y) of the image I. Morphological opening of the image, is denoted by  $I_o(x, y)$  whereas  $T_o$  specifies the threshold of  $I_o$ . Images obtained from our platform have darker foreground patterns than the background for image binarization the features of thresholding including foreground polarity and sensitivity are properly controlled. Patterns are finally obtained by image dilation using square structuring elements. Opening in the morphological image processing is given as  $I \circ b$ , where dilation of the image Iby a structuring element b can be expressed as:

$$I \circ b = (I \ominus b) \oplus b \tag{4}$$

Final output processed images are displayed using blob analysis, and Fig. 10 shows the performance of our proposed method to detect the patterns from the blood-covered situation, All the patterns in the input image can be extracted from the blood, without been affected by uneven light illumination condition. Each pattern with all its feature points and four vertexes can be easily recognized. The performance of a proposed algorithm is compared with the top hat algorithm from our previous work in Fig. 11.

Sample images with different orientations are taken as an input, the proposed method can recognize all the patterns their corresponding vertexes and respective features quite accurately.

The proposed algorithm compared to the top hat algorithm achieves a detection rate as high as 99% with less complexity and more accuracy.

### B. Method for Patterns Recognition

After detecting patterns from the blood, the next step is to recognize the patterns, each pattern represents its specific 9-bit binary ID, and these customized patterns are made up of nine

Detection



Fig. 11. Comparison of Patterns detection. The total number of detected patterns are listed at the end of each row. (a) Input sample images covered in the blood covered situation. (b) Patterns detected using top-hat algorithm [13]. (c) Patterns detection results using the proposed algorithm in this paper.



Fig. 12. Pattern recognition.

small blocks, black represents "0" while the white block represents "1". We have developed a method to recognize the detected patterns, and the algorithm will first find the contours of the patterns quadrangle in shape. These quadrangles are converted into squares of length 100 pixels, creating each cell of size  $20 \times 20$  pixels inside each square. Next step in the algorithm is to find the center of each individual cell within a square if the center of the cell is black it will be detected as "0" and if the center of the cell is white, it will be read as "1".

Fig. 12 shows the output results of the pattern recognition method, it can be seen that all the patterns appeared in the frame can be recognized efficiently. These patterns detection and recognition has provided a major step forward towards pose estimation for the application.

## IV. IMAGE PROCESSING ON A RECONFIGURABLE HARDWARE

Complete embedded vision system on chip is nowadays realizable [24]. We are integrating an image sensor with FPGA to develop a compact vision system on chip for THR surgery. Xilinx



Fig. 13. Architecture for implementation on FPGA.

Virtex 7 board is used to implement image processing algorithms for the ASIC development and to perform parallel processing on the image data pixel by pixel received from the sensor. FPGA processes the image data and transmits the detected feature points of each pattern to the computer. Software developed with Visual Studio calculates the pose estimation based on the processed images. Image data transmission for post-processing using FPGA consumes less power compared to a Wi-Fi module used in our previous works.

For the realization of image processing on reconfigurable hardware, we are using the image data from the acetabular cup acquired by the CMOS image sensor of OV7660. The complete design flow of implementing image processing on FPGA is depicted in Fig. 13, the image size of resolution  $286 \times 186$  from our smart trial prototype is chosen as an input for processing. Since the images cannot be read by FPGA in bitmap format the image is required to be converted into hexadecimal format. Pre-processing of converting the bitmap into hexadecimal is done in Matlab.

The hexadecimal format includes RGB data of the image, which is loaded onto the FPGA using RS 232 cable. The universal asynchronous receiver-transmitter (UART) port is used to establish a serial communication link between the computer and the FPGA. The RGB input data is saved onto on-chip BRAM at the address locations generated by the address generation unit (AGU). The RGB data is read out from block RAM and sent to the 24 bits shift register for RGB combination for further processing. To speed up the processing time, two pixels are processed and transmitted in parallel. The processing block performs three operations (1) brightness control for uneven illumination conditions in the input images, (2) segmentation of patterns from red color is realized to avoid interference of blood in detecting the patterns, and (3) threshold operation is used for isolating the patterns of interest.

In the next step, the processed RGB image data is written onto three separate BRAMs. AGU sends a valid signal to the finite state machine block when the image writing process stops. The finite state machine generates a valid signal for the multiplexer and AGU to read the processed data from the BRAM. The processed image data is sent back to the computer for postprocessing using UAT.

#### V. EXPERIMENTAL RESULTS AND DISCUSSIONS

Xilinx 1SE 14.4 design suit and ModelSim 10.0 SE is adopted to implement the image processing algorithms for functional design verification and simulations. The processing algorithms are synthesized on Xilinx Virtex 7 of part number xc7vx485t-2ffg1761 FPGA board for its real-time implementation. For simulation purposes, red ink is used in place of blood and poured inside the acetabular cup, which is covered with the plastic wrap, and the images are obtained from the platform exemplified in Fig. 8. The input image of size  $286 \times 186$  is converted into hexadecimal format because the FPGA cannot handle standard image formats, the hexadecimal image data is then sent through the serial communication protocol using Universal Asynchronous Receiver (UAR) into the block RAM, the image data is readout of the block RAM into 24-bit shift register for RGB combination. In the application, uneven light conditions, blood interference and features extractions are the key problems to address. We implemented three operations including brightness control, image inversion and thresholding to extract the features of the patterns effectively.

By adding a constant value to each pixel value can increase the brightness of low light intensity, whereas by subtracting the constant value from the pixel value will decrease the brightness of the image. Segmentation of patterns from the red color to see the details of each pattern in more detail, an inversion operation is applied to the input images. The output results can be seen in Fig. 14, where each pattern is segmented from the red color. This operation is useful in the application because of uneven light illumination can produce a shadowed or saturated regions [25] in the images causing it difficult to identify the patterns in the blood-covered situation.

The output from this operation can be used for pattern detection for further processing, required in the pose estimation. The inversion operation reverses the order of pixel values when multiplied by -1 and by adding the constant value maps the output result in the permissible range. Consider a pixel value n = I(x, y) within the range [0,  $n_{\text{max}}$ ]; the inversion operation can be expressed as equation (5)

$$f_{inv}(n) = -n + n_{\max} = n_{\max} - n$$
 (5)

as an input is a color image, the RGB component of each pixel is equalized by taking the average of these color components [26]. Threshold operation is also significant for our application. In the

Fig. 14. Experiment results by implementing the image processing algorithms. (a) Output image after implementing brightness control operation. (b) Output image after segmentation of patterns from the red color by implementing image inversion operation. (c) Output image after the implementation of threshold control operation.

segmentation process, thresholding helps to isolate the patterns from the background. The operation transforms all the pixels in two values, in this type of quantization the pixel values are compared with the constant threshold value  $a_{th}$ , which allows separating the pixels into two different classes. The function can be expressed as equation (6) [27], the function  $f_{th}(a)$  will map the pixels values to either  $a_0$  or  $a_1$  that is the

$$f_{th}(a) = \begin{cases} a_0, & \text{for } a < a_{th} \\ a_1, & \text{for } a \ge a_{th} \end{cases}$$
(6)

where  $0 < a_{th} < a_{max}$ .

The processed image data is separated into its RGB components and sent into the three separate memory locations respectively. Once the process is complete the bitmap image data of the processed image is written onto the memory locations, the next step is to write the header for the image without the header image cannot be displayed at the output properly. The experiment setup is illustrated in Fig. 15, in which Xilinx FPGA is adopted in order to verify our processing on real-time hardware. The input image of size  $286 \times 186$  with the patterns covered in the blood after three operations including the brightness, image inversion and the threshold the patterns can be analyzed quite efficiently in FPGA. Behavioral model of the algorithm is analyzed in ModelSim 10.0 SE and Xilinx ISE 14.4 is used for the synthesis and implementation. The implementation results are summarized in Table II. We can see that the synthesis frequency is 304 MHz using the LUTs and slice registers 760 and 819 respectively with a baud rate of 9600. Overall, power consumption in processing is around 213 mW, which considerably reduces power consumption by the whole system. The image data from the FPGA using serial communication link Universal Asynchronous Transmitter (UAT) is sent back to the computer for post processing.





Fig. 15. Experiment setup.

TABLE II IMPLEMENTATION SUMMARY FROM VIRTEX 7 FPGA BOARD

| Design                            | LUTs                                            | Slice<br>Registers                                  | Frequency                                            | BRAMs                              | Power<br>(mW)               |
|-----------------------------------|-------------------------------------------------|-----------------------------------------------------|------------------------------------------------------|------------------------------------|-----------------------------|
| Serial<br>Test                    | 760                                             | 819                                                 | ~304 MHz                                             | 167                                | 213                         |
| CMOS<br>Sen<br>200 x<br>Honge Sen | Image<br>sor<br>200/<br>off chip<br>sing Module | Image<br>data<br>CLI<br>Camera<br>Link<br>Interface | Image Processir<br>I/P Pre<br>CLI Processing<br>FPGA | Ig<br>Extraction<br>of<br>features | Image<br>Post<br>Processing |

Fig. 16. Embedded vision system design flow.

An embedded vision system architecture to be used for the application is depicted in Fig. 16. The image data from our application-specific image sensor will be sent to the FPGA using a camera link interface. Where the FPGA will provide an on-chip image processing for the image data provided by the sensor.

Power consumption based on monocular vision (customized patterns) and IMU-based system are measured to be around 980 mW and 768 mW respectively, which is relativity high and consumes most of the system power. Implementation of algorithms proposed in this paper for on-chip processing and transmitting the images consumes only 213 mW that is considerably lower than the power consumptions presented in our previous work. Table III shows performance comparisons of the method proposed in our previous works. Parallel processing of the image data on-chip reduces the power consumed by the

TABLE III PERFORMANCE OF IMPLEMENTATION METHODS

| Method                                         | Processing<br>Platform | Accuracy<br>(Patterns<br>Detection) | Power |
|------------------------------------------------|------------------------|-------------------------------------|-------|
| Monocular vision<br>(customized patterns)      | CPU                    | 94.1%                               | 980mW |
| IMU and monocular vision (chessboard patterns) | CPU                    | -                                   | 768mW |
| This Work (customized patterns)                | Xilinx<br>Vertex-7     | 99%                                 | 213mW |

\*The performance of monocular vision and IMU based methods are reported from our from previous work [13].

Wi-Fi transceiver. The feature points and coordinates of each detected pattern are transmitted directly to the computer using a serial communication link where software developed in Visual Studio estimates the relative pose between the femoral head and acetabular cup prosthesis from the extracted feature points after processing and the system initialized feature points.

So far, our image sensor with its custom resolution, size, and low power is considered enough for the application. However, in our future work, we will consider speed and accuracy by dividing the array of  $200 \times 200$  pixels into  $4 \times 4$  pixels block, to introduce signal processing within a single block as well as neighboring blocks of pixels for fast computation. This method will distribute the processing workload of our entire image sensor. ASIC design for image processing will be realized in the near future for the optimization of power consumption. As a result, the SoC design together with the CMOS image sensor for the application will be implemented.

#### VI. CONCLUSION

In this paper, we first present the design and simulation of CMOS image sensor for our smart trial. The proposed image sensor replaces a bulky camera from the femoral head, which consumes most of the system power. In future, we will consider speed and accuracy of the sensor by dividing the array of  $200 \times$ 200 pixels into  $4 \times 4$  pixels block, for fast calculation within the block. Secondly, we propose a new method to detect each pattern from the blood-covered situation and then recognize each pattern from its features by generating its corresponding binary ID. The pattern detection rate is as high as 99%, which is 5% better in accuracy than the top hat algorithm. Furthermore, in order to reduce power consumption by the Wi-Fi module to transmit data wirelessly to the computer outside for pose estimation calculation. Image processing algorithms are implemented on FPGA to perform parallel processing on the image data pixel by pixel received from the sensor. The extracted feature points and coordinates are directly sent to the computer using a serial communication link without using a Wi-Fi transceiver, which reduced the overall system power. The design is synthesized using Xilinx Virtex 7 FPGA board for the ASIC development. Power consumption by the system is 213 mW, which is a 70% decrease compared to 980 mW and 768 mW consumed by monocular and IMU-based systems respectively. The reduction

of power consumption in this work significantly improves the operation time of our system. This digital design implementation will help for our final SoC design by integrating the CMOS image sensor with on-chip image processing in the future.

#### REFERENCES

- B. J. Thomas and J. B. Stiehl, "Basic of total hip replacement surgery," in *Navigation and Robotics in Total Joint and Spine Surgery*, J. B. Stiehl, W. H. Konermann, and R. G. Haaker, Eds. Berlin, Germany: Springer, 2004.
- [2] K. J. Bozic, "Fifth AJRR annual report on hip and knee arthroplasty data," Amer. Joint Replacement Registry, Rosemont, IL, USA, Annu. Rep, 2018.
- [3] J. Nutt, K. Papanikolaou, and C. Kellett, "Complications of total hip arthroplasty," *Orthopaedics Trauma*, vol. 27, no. 5, pp. 272–276, 2013.
- [4] A. Blom, M. Rogers, A. H. Taylor, G. Pattison, S. Whitehouse, and G. C. Bannister, "Dislocation following total hip replacement: The Avon orthopaedic centre experience," *Ann. Royal College Surgeons England*, vol. 90, no. 8, pp. 658–662, 2008.
- [5] A. Bunn, C. Colwell, and D. D'Lima, "Bony impingement limits designrelated increases in hip range of motion," *Clin. Orthopaedics Related Res.*, vol. 470, no. 2, pp. 418–427, 2011.
- [6] K. H. Widmer, "Is there really a 'safe zone' for the placement of total hip components?," *Ceram. Orthopaedics*, vol. 75, no. 5, pp. 249–252, 2006.
- [7] H. Miki, W. Yamanashi, T. Nishii, Y. Sato, H. Yoshikawa, and N. Sugano, "Anatomic hip range of motion after implantation during total hip arthroplasty as measured by a navigation system," *J. Arthroplasty*, vol. 22, no. 7, pp. 946–952, 2007.
- [8] S. Parratte, "Validation and usefulness of a computer-assisted cuppositioning system in total hip arthroplasty," J. Bone Joint Surgery Amer., vol. 89, no. 3, pp. 494–499, 2007.
- [9] A. Martin and A. von Strempel, "CT-based and CT-free navigation in total knee arthroplasty–A prospective comparative study with respects to clinical and radiological results," *Zeitschrift fur Orthopadie und ihre Grenzgebiete*, vol. 143, no. 3, pp. 323–328, 2004.
- [10] H. Kiefer and A. Othman, "OrthoPilot total hip arthroplasty workflow and surgery," *Orthopedics*, vol. 28, no. 28, pp. 1221–1226, 2005.
- [11] R. McGinnis *et al.*, "Accuracy of femur angles estimated by IMUs during clinical procedures used to diagnose femoroacetabular impingement," *IEEE Trans. Biomed. Eng.*, vol. 62, no. 6, pp. 1503–1513, Jun. 2015.
- [12] H. Chen, J. Gao, S. Su, X. Zhang, and Z. Wang, "A visual-aided wireless monitoring system design for total hip replacement surgery," *IEEE Trans. Biomed. Circuits Syst.*, vol. 9, no. 2, pp. 227–236, Apr. 2015.
- [13] S. Su, Y. Zhou, Z. Wang, and H. Chen, "Monocular vision- and IMUbased system for prosthesis pose estimation during total hip replacement surgery," *IEEE Trans. Biomed. Circuits Syst.*, Jun. 2017, no. 3, vol. 11, pp. 661–670.
- [14] F. Wang and A. J. P. Theuwissen, "Linearity analysis of a CMOS image sensor," in *Proc. Electron. Imag.*, 2017, pp. 84–90.
- [15] Z. Yang, V. Gruev, and J. Van der Spiegel, "Low fixed pattern noise currentmode imager using velocity saturated readout transistors," in *Proc. IEEE Int. Symp. Circuits Syst.*, 2007, pp. 2842–2845.
- [16] X. Wu, M. Zhang, and J. Van der Spiegel, "High linearity current mode image sensor," in *Proc. IEEE Int. Conf. Electron Devices Solid State Circuit*, 2012, pp. 1–4..
- [17] S. Mudassir, X. Cao, S. Su, H. Chen, and Z. Wang, "Design and simulation of an application specific CMOS image sensor," in *Proc. IEEE Int. Conf. Electron. Commun. Eng.*, 2018, pp. 110–114.
- [18] S. Mudassir, S. Su, M. Long, and Z. Wang, "Patterns detection and recognition in visual aided system for prosthesis pose estimation during total hip replacement surgery," in *Proc. IEEE Asia Pacific Conf. Circuits Syst.*, 2018, pp. 556–559.
- [19] M. Sonka, V. Hlavac, and R. Boyle, *Image Processing, Analysis and Machine Vision*, 4th ed. Andover, U.K.: Cengage Learning, 2014.
- [20] J. Arribas *et al.*, "Leaf classification in sunflower crops by computer vision and neural networks," *Comput. Electron. Agriculture.*, vol. 78, no. 1, pp. 9–18, 2011.
- [21] N. Noor, N. Khalid, and N. Ariff, "Optic cup and disc color channel multi-thresholding segmentation," in *Proc. IEEE Int. Conf. Control Syst.*, *Comput. Eng.*, 2013, pp. 530–534.
- [22] R. Gonzalez, R. Woods, and S. Eddins, *Digital Image Processing Using MATLAB*. Upper Saddle River, N.J., USA: Pearson Prentice Hall, 2004.

- [23] T. Romen, S. Roy, O. Singh, T. Sinam, and K. Singh, "A new local adaptive thresholding technique in binarization," *Int. J. Comput. Sci. Issues*, vol. 8, no. 6, pp. 271–277, Nov. 2011.
- [24] F. Paillet and D. Mercier, "Design solutions and techniques for vision system on a chip and fine-grain parallelism circuit integration," in *Proc. 13th Annu. IEEE Int. ASIC/SOC Conf.*, Arlington, VA, USA, Sep. 13–16, 2000.
- [25] John C. Russ, *Image Processing Handbook*, 6th ed. Boca Raton, FL, USA: CRC Press, pp. 270–331, 2011.
- [26] W. Burger and M. Burge, Principles of Digital Image Processing Fundamental Techniques (Undergraduate Topics in Computer Science), London, U.K.: Springer-Verlag London Limited, 2009, DOI: 10.1007/978-1-84800-191-6\_4.
- [27] I. Chiuchisan, "A new FPGA-based real-time configurable system for medical image processing," in *Proc. E-Health Bioengineering Conf.*, 2013, pp. 1–4.



Syed Mudassir Hussain received the B.S. and M.S. degrees in electronic engineering from the Balochistan University of Information Technology Engineering and Management Sciences (BUITEMS), Quetta, Pakistan, in 2010 and 2015, respectively. He is currently working toward the Ph.D. degree with the Department of Microelectronics and Nanoelectronics, Tsinghua University, Beijing, China. Since 2011, he has been working with the Department of Electronic Engineering, BUITEMS, where he is currently an Assistant Professor. His research interests include

integrated circuit designs and digital image processing. His ongoing research is about SoC design for a visual aided system for total hip arthroplasty surgeries.



Fasih Ud Din Farrukh received the B.Sc. degree in electronic engineering from the University College of Engineering and Technology, Islamia University Bahawalpur, Pakistan, in 2004, and the M.S. degree in electronic engineering from the Ghulam Ishaq Khan Institute of Engineering Sciences & Technology, Pakistan, in 2007. He is currently working toward the Ph.D. degree in electronic science and technology at the Institute of Microelectronics (IME), Tsinghua University, Beijing, China. Before joining, he was a Senior Design Engineer in a leading R&D organi-

zation in Pakistan. His current research mainly focuses on AI-based system designs. His ongoing research is about the integrated circuit designs for AI-based system applications and SoC design for medical applications.



Shaojie Su received the B.S. degree from the School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, China, in 2012, and the Ph.D. degree from Tsinghua University, Beijing, China. His current research interests include biomedical applications. His ongoing work is about a visual aided system for total hip replacement surgeries.



Zhihua Wang (M'99–SM'04–F'17) received the B.S., M.S., and Ph.D. degrees in electronic engineering from Tsinghua University, Beijing, China, in 1983, 1985, and 1990, respectively. He has worked as a Full Professor and Deputy Director of the Institute of Microelectronics, Tsinghua University since 1997 and 2000, respectively. He was a Visiting Scholar with Carnegie Mellon University (CMU) from 1992 to 1993 and KU Leuven, Belgium, from 1993 to 1994, and was a Visiting Professor with Hong Kong University of Science and Technology (HKUST) from

September 2014 to March 2015. He has coauthored 11 books/chapters, over 146 (396) papers in international journals (conferences), and holds 97 Chinese and five US patents. His research interests include CMOS RFIC and biomedical applications, involving RFID, PLL, low-power wireless transceivers, and smart clinic equipment combined with leading-edge RFIC and digital image processing techniques.

Prof. Wang has served as the Chairman of the IEEE SSCS Beijing Chapter from 1999 to 2009, an AdCom Member of the IEEE SSCS from 2016 to 2019, a Technology Program Committee Member of the IEEE ISSCC from 2005 to 2011, a Steering Committee Member of the IEEE A-SSCC (2005-), the Technical Program Chair for A-SSCC 2013, a Guest Editor for IEEE JSSC Special Issues, an Associate Editor of the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I, II and IEEE TRANSACTION ON BIOMEDICAL CIRCUITS AND SYSTEMS, and other administrative/expert committee positions in China's national science and technology projects.



Hong Chen (S'03–M'07–SM'17) received the Ph.D. degree from Tsinghua University, in 2005. From 2005 to 2007, she worked with the Institute of Microelectronics in Tsinghua University (IMETU) as a Post-doctoral Fellow, where currently she is working as an Assistant Professor since 2007. In 2006 and 2016, she worked with the Medical Center, Nebraska University, Lincoln, NE, USA, and with the Department of Electronics and Computer Engineering, Georgia Tech, Atlanta, GA, USA, respectively, as a Visiting Scholar. She has authored or coauthored 75 journal

and conference papers and two books. She also holds 18 granted Chinese and American patents. Her research interests include monitoring-system design for TKR/THR surgery, low-power digital integrated-circuit design, asynchronous circuit design, PZT power electronics, and low-power mixed-signal SoC design. Dr. Chen has been a Technologies Program Committee Member of ASYNC since 2014. She received the Best Demo Award at ISCAS 2013.