Journal Search Engine

ISSN : 2005-0461(Print)
ISSN : 2287-7975(Online)

Journal of Society of Korea Industrial and Systems Engineering Vol.46 No.1 pp.55-67
DOI : https://doi.org/10.11627/jksie.2023.46.1.055

A Comprehensive Survey of Lightweight Neural Networks for Face Recognition

Yongli Zhang

, Jaekyung Yang†

Department of Industrial and Information Systems Engineering, Jeonbuk National University

^†Corresponding Author : jkyang@jbnu.ac.kr

Received 15/02/2023 Finally Revised 09/03/2023 Accepted 09/03/2023

Abstract

Lightweight face recognition models, as one of the most popular and long-standing topics in the field of computer vision, has achieved vigorous development and has been widely used in many real-world applications due to fewer number of parameters, lower floating-point operations, and smaller model size. However, few surveys reviewed lightweight models and reimplemented these lightweight models by using the same calculating resource and training dataset. In this survey article, we present a comprehensive review about the recent research advances on the end-to-end efficient lightweight face recognition models and reimplement several of the most popular models. To start with, we introduce the overview of face recognition with lightweight models. Then, based on the construction of models, we categorize the lightweight models into: (1) artificially designing lightweight FR models, (2) pruned models to face recognition, (3) efficient automatic neural network architecture design based on neural architecture searching, (4) Knowledge distillation and (5) low-rank decomposition. As an example, we also introduce the SqueezeFaceNet and EfficientFaceNet by pruning SqueezeNet and EfficientNet. Additionally, we reimplement and present a detailed performance comparison of different lightweight models on the nine different test benchmarks. At last, the challenges and future works are provided. There are three main contributions in our survey: firstly, the categorized lightweight models can be conveniently identified so that we can explore new lightweight models for face recognition; secondly, the comprehensive performance comparisons are carried out so that ones can choose models when a state-of-the-art end-to-end face recognition system is deployed on mobile devices; thirdly, the challenges and future trends are stated to inspire our future works.

Key Words : Face Recognition , Lightweight Networks , Mobile Device , Efficient Face Models

얼굴 인식을 위한 경량 인공 신경망 연구 조사

장 영립, 양 재경†

전북대학교 산업정보시스템공학과

초록

키워드 :

This article has been cited by 0 article in crossref

Cited-By

Funding:

1. Introduction

With the great advance of Deep Convolution Neural Networks (DCNNs) techniques [26,31,36,37,41,62,67] and large-scale datasets [13,58] in the field of computer vision and image understanding, the DCNNs-based techniques have become an extensive research topic in Face Recognition (FR) tasks and also have been widely used in many real-world applications. Therefore, more and more novel and efficient FR DCNNs [9,55,60,63,64,65,66,68] and the margin-based loss functions [14,35,43,73] were designed and replaced traditional methods [4,10,16,27,38,42,53,71,77,83], becoming the mainstream methods of FR. However, the large floating-point operations (FLOPs), large number of parameters, and huge models size lead to high computational complex ity, making it difficult to deploy the DCNNs models on the Internet of Things (IoT) or mobile devices with limited memory in practical applications [9,60,68], such as video surveillance, law enforcement, access control, marketing, smartphones, embedded systems, wearable devices, etc.. And large-scale face datasets with the pose changes, illumination changes, low resolution, and motion blur will also lead to a challenge in recognition accuracy [15,49]. Therefore, to solve these problems, some scholars are working on efficient and effective lightweight models with fewer parameters, lower FLOPs, and smaller model sizes.

To get lightweight FR models, many efforts have been devoted to keep an optimal trade-off between accuracy and efficiency. So, it is necessary to review recent lightweight FR models to inspire the development and application of lightweight FR. Specifically, there have been certain surveys [1,2,17,34,49,50,72,87] about FR, which review almost all FR DCNNs models. However, they do not cover recent published lightweight FR models; and some of them focus on some specific tasks; besides, few surveys reimplement reviewed lightweight models. And, to the best of our knowledge, there has been only one article [49] that fulfilled relevant survey for lightweight FR models, which provided a benchmark of lightweight FR. In summary, the end-to-end lightweight FR models need to be systematically reviewed from a variety of different perspectives, however, few of the existing surveys attach importance to this task. So, our survey will focus on reviewing lightweight FR models the most needed in practical applications. Different from [49], this article will classify the existing state-of-the-art end-to-end lightweight models into five categories and reimplement several mainstream models. The main contributions can be summarized as follows:

Firstly, we categorize the lightweight FR models into five aspects to further explore new lightweight FR models.

Secondly, we introduce the SqueezeFaceNet and EfficientFaceNet by pruning SqueezeNet and EfficientNet and reimplement several most popular lightweight FR models. Meanwhile, the comprehensive performance comparisons are also presented.

Thirdly, we present some the challenges and future trends to inspire our future works.

The remainder of the paper is organized as follows. Section 2 briefly reviews what are the FR tasks and lightweight models, and the categories of existing state-of-the-art end-to-end lightweight models are also described. As an example, we introduce the SqueezeFaceNet and EfficientFaceNet by pruning SqueezeNet and EfficientNet in Section 3. In Section 4, we reimplement and present a detailed performance comparison of different lightweight models on nine different test benchmarks. In Section 5, some challenges and future works are presented. Section 6 concludes this work.

2. Face Recognition and Lightweight Model

2.1 Face Recognition

An automatic face recognition system, which consisted of four main components: face detection, face alignment, feature extraction and face matching, aims at implementing two different tasks, namely, one-to-one (1:1) Face Verification or one-to-many (1:N) Face Identification [17]. Face verification refers to judging whether two face images belong to the same identity or not, which does not need to know the identity of the image. It is essentially a binary classification problem, and is usually used in scenarios such as witness comparison and identity verification. Face identification refers to judging the identity of probe face descriptor according to a registered face gallery. It is essentially a multi-classification problem. Common application scenarios include access control systems, venue sign-in systems and so on.

In view of the broad application prospects of FR, a series of traditional FR methods sprang up from the early 1990s, such as Eigenfaces [71], Fisherfaces [4], Bayesian eigenfaces [53], Laplacianfaces [27], Sparse Representation [16,77,83], Gabor feature [42] and learning-based descriptor [10,38]. However, the above methods either fail to address the uncontrolled facial changes, or lack of distinctiveness and compactness, or shallow representations [72]. To address above problems, based on the LeNet [37] which was applied to handwritten digit recognition in 1990, and AlexNet [37], NiN [41], VGG [62], GoogleNet [67], ResNet [26], DenseNet [31], which won the ImageNet competition champion, many FR DCNNs models have achieved earth-shaking changes, such as Facenet [60], Deepface [68], DeepID seiers [63,64,65,66], VGGFace [55] and VGGFace2 [9]. Especially, various novel margin-based loss functions, ArcFace [14] SphereFace [43] Cosface [73] Adaface [35], also greatly promote the recognition performance.

A standard pipeline of automatic FR system is shown in the <Figure 1>. When the system gets still images or video frames as input, face detection locates the face regions of input; then, located face is calibrate and resize normalized pixel in the face alignment stage; additionally, the feature extraction stage extracts the discriminative features using DCNNs; at last, face verification or face identification task is conducted in the face matching stage.

However, it is difficult to deploy these FR DCNNs on the IoT or mobile devices with limited memory due to large parameters, FLOPs, and model size. So, implementing optimal trade-off between accuracy and efficiency is becoming more and more important.

2.2 Lightweight Model

Considering the huge demand for deploying FR models on IoT or mobile devices with limited memory, the lightweight FR models have been researched to keep an optimal trade-off between performance and efficiency in recent years. And these lightweight FR models are also subject to standard pipeline of automatic FR system shown in the <Figure 1>. Nowadays, there have been a variety of lightweight architectures achieving state of the art (SOTA) performance, which can be categorized into: (1) artificially designing lightweight FR models (ADLM), (2) pruned models to face recognition (PM), (3) efficient automatic neural network architecture design based on neural architecture searching (ANND_NAS), (4) Knowledge distillation (KD) and (5) Low-rank decomposition (LRD). The <Table 1> lists the categorization of lightweight FR models. We mainly present the name or author of models, input size, MFLOPs, the number of parameters, model size, accuracy on LFW [32] and published years to facilitate comparison.

For the first class, the ADLM means the researcher artificially designed an efficient lightweight RF model that keeps an optimal trade-off between performance and efficiency. Wu et al. developed a LightCNN [78] family architectures, which are LightCNN-4, LightCNN-9, LightCNN-29, respectively, to learn a robust face representation on a noisy labeled dataset. There are 4.095M, 5.556M, 12.637M parameters, and about 1500 MFLOPs, 1000 MFLOPs, 3900 MFLOPs in the LightCNN-4, LightCNN-9, LightCNN-29, respectively. ConvFaceNeXt [29] is designed by Hoo et al., which stacks the stem, bottleneck, and embedding partitions to construct a ConvFaceNeXt family. The largest model contains 1.05M parameters and 410.59 MFLOPs.

For the second class, the PM means the researcher tailored FR models by pruning commonly used SOTA lightweight networks, which include the MobileNet series [30,59], ShuffleNetV2 [46], Mixconv [69], VarGNet [85] and so on, to construct MobileFaceNetV1 [49], MobileFaceNets [11], ShuffleFaceNet [48], MixFaceNet [6], VarGFaceNet [80] and so on. Based on the MobileNetV2 [59], MobileFaceNetV1 [49] and MobileFaceNets [11] are designed. The MobileFaceNets adopts a typical reverse residual block, which needs to train about 1.03M parameters with 473.15M FLOPs. And a family of ShuffleFaceNet [48] are built based on the ShuffleNetV2, in which the smallest is ShuffleFaceNet0.5× with about 0.5M parameters and 66.9M FLOPs, but it does not keep significant accuracy. Therefore, ShuffleFaceNet1× is commonly used, which contains about 1.4M parameters and 275.8M FLOPs.

For the third class, the ANND_NAS means to directly learn neural network architectures for FR based on the neural architecture searching (NAS) [90]. The article [49] introduces a modified version of the ProxylessFaceNAS based on the ProxylessNAS [8], and there are 3.01M parameters, and about 873.95 MFLOPs. Boutros proposed a family of PocketNet based on the NAS and knowledge distillation [28], which contains 0.925M parameters and 587.11 MFLOPs.

For the fourth class, the Knowledge distillation (KD) [57,82], training a small student network based on the large teacher network, is to train a compact neural network which can reimple ment the output of the large networks. The KD-based EC-KD [74] and ShrinkTeaNet [18] can be found in <Table 1>.

For the fifth class, the Low-rank decomposition (LRD) refers to use the low rank matrix to approximate the weight matrix of DCNNs, getting compressed lightweight DCNNs. The SILR [81] and LRRNet [86] can be found in <Table 1>.

These SOTA models aim to improve the compactness and computational efficiency, overcoming the difficulty of deployment due to trainable massive parameters. Meanwhile, as shown in <Table 1>, we also simply prune the SqueezeNet [33] and EfficientNet [70] to construct SqueezeFaceNet and EfficientFaceNet, and the details are introduced in section 3.

3. SqueezeFaceNet and EfficientFaceNet

As the simple supplement of the PM, we introduce the SqueezeFaceNet and EfficientFaceNet based on the SqueezeNet [33] and EfficientNet [70] in this section.

3.1 SqueezeFaceNet

SqueezeFaceNet is simply pruned from the SqueezeNet [33], which consists of fire modules and a Global Depthwise Convolution (GDC) [11] layer. The fire modules [33] extracts useful features and GDC treats different units with different importance [11]. At last, we adopt novel margin-based ArcFace [14] as loss function. The <Table 2> shows the architecture of SqueezeFaceNet.

3.2 EfficientFaceNet

EfficientFaceNet is simply pruned from the EfficientNet [70], which consists of MBConv1, MBConv6 modules and a GDC [11] layer. The MBConv [70] extracts useful features and GDC treats different units with different importance [11]. At last, we adopt novel margin-based ArcFace [14] as loss function. The <Table 3> shows the architecture of EfficientFaceNet.

4. Performance Comparison

This section comprehensively presents the performance of the most common models which are listed <Table 1>. We reimplement the results of EfficientFaceNet, SqueezeFaceNet, MixFaceNet, ShuffleFaceNet0.5x, MobileFaceNet, LightCNN-9 to make fair comparison by using the same computing resources, train dataset and setting the same hyper-parameters. All codes refer to the original papers and ArcFace2) [14].

4.1 Experimental Settings

Train Datasets. We use the MS1MV3 [15] dataset containing approximately 93K identities and 5.2M images as the train dataset in our experiments, which is semi-automatically cleaned from MS1MV0 (about 10M images of 100K identities) [14] and which is also an enhanced version of the MS1MV2 (about 5.8M images of 85K identities) [25].

Test Datasets. In order to systematically compare the listed lightweight methods, there are nine challenging test datasets. The details are listed in <Table 4>. In the test stage, 1:N Face Identification are conducted on IJB-B and IJB-C, and 1:1 Face Verification are done on the other datasets, IJB-B and IJB-C.

Implementation Details. In our experiments, all training details are following the ArcFace [14]. The batch-size is 128, and the optimizer is SGD with a learning rate of 0.1, a momentum of 0.9, and a weight decay of 1e-4. The train epochs are 40, and the scale parameter s and margin value of ArcFace loss are set to 64 and 0.5, respectively. We use a Linux machine (Ubuntu 18.04.1 LTS) with Intel(R) Core(TM) i9-9900KS CPU @ 4, 32 G RAM, and 3 Nvidia GeForce RTX 2060 (6Gb) GPUs to finish all experiments. Meanwhile, the experiments are implemented by PyTorch v1.11.0 [56] and mixed-precision [52] is employed to save GPUs memory and accelerate training.

4.2 Evaluation and Comparison

The 1:N Face Identification and 1:1 Face Verification are conducted in this section. All the experiments not only compare the number of parameters, model size and FLOPs of different models, but also evaluate the recognition accuracy. The parameters, model size and MFLOPs are the less the better, on the contrary, the accuracies are the higher the better. As shown in the <Table 5>, <Table 6> and <Table 7>, which all are divided into two parts by a dotted line, the upper data of dotted line refer to related papers, and the bottom data of dotted line come from our reimplemented experiments.

1:1 Face Verification Evaluation Results.

A series of 1:1 face verification experiments are conducted on the datasets of LFW, CA-LFW, AgeDB-30, CP-LFW, CFP-FP, CFP-FF and VGG2-FP, and the performances are reported as the accuracy of 10-fold cross-validation. The referred data and reimplemented data are all listed into the <Table 5>.

For better investigating performance of lightweight models on two IJB datasets which combine high- or low-quality images and video frames, a series of one-to-one (1:1) face verification experiments are conducted on both IJBs. TAR@FAR are reported as metrics of face verification, which is the higher the better. The referred data and reimplemented data are all listed into <Table 6>.

1:N Face Identification Evaluation Results.

A series of 1:N face identification experiments are conducted on both IJBs. Rank-1 and Rank-5 accuracy are reported as metrics of face identification, which are the higher the better. The referred data and reimplemented data are all listed into <Table 7>.

Seen from <Table 5>, <Table 6> and <Table 7>, generally speaking, more trainable parameters mean higher accuracy, but also higher computational complexity and larger model size. Meanwhile, we also can find there are some small and refined models, such as MobileFaceNet, ELANet, MixFaceNet and so on.

4.3 Performance vs. the Number of Parameters

This section plots the number of parameters vs. the performance of lightweight models to present the trade-off between the performance and the number of parameters. As shown in the <Figure 2>, the x-axis of each subplots from (a) to (i) represents the LFW (accuracy), CALFW(accuracy), CPLFW (accuracy), CFP-FF (accuracy), CFP-FP (accuracy), AgeDB-30 (accuracy), VGG2-FP (accuracy), IJB-C (TAR at FAR1e-4), and IJB-B (TAR at FAR1e-4), and the y-axis is the number of parameters. Different methods are highlighted with different markers and different colors. The models locating the conner of the left upper in every subplot indicate high performance and less number of parameters. We can find the MobileFaceNet and MixFaceNet report much better trade-off.

5. Challenge and Future Work

Although the lightweight FR has achieved remarkable performance, there are still many challenges. In my opinion, one challenge is how to design small and refined lightweight model which keeps optimal trade-off between performance and efficiency. In addition, how to response diverse train and test datasets which include large facial pose, extreme expression, occlusion, facial scale, motion blur, low illumination, large-scale. Finally, the interpretability of the lightweight model is also anther worthy challenge.

So, our future work mainly focuses on addressing the above challenges. For designing lightweight model, we will consider designing lightweight model, or using automatic machine learning methods like NAS to search for networks. For diverse train and test datasets, we will consider the cross age, pose, race, and so on. For interpretability of the lightweight model, we mainly consider explanation in spatial and scale dimensions.

6. Discussion

In this survey, we comprehensively reviewed the recent lightweight models for face recognition. Firstly, a standard pipeline of automatic lightweight FR system was presented. Secondly, we categorized the listed lightweight models into ADLM, PM, ANND_NAS, KD and LRD according to the different designing mode of models. Thirdly, we introduceed the SqueezeFaceNet and EfficientFaceNet by pruning SqueezeNet and EfficientNet. Fourthly, we reimplemented the results of EfficientFaceNet, SqueezeFaceNet, MixFaceNet, ShuffleFaceNet0.5x, MobileFaceNet, LightCNN-9 to eliminate the influence of different computing resources, train data set and setting hyper-parameters. At last, these results can be seen as benchmark for comparison and direct reference. This survey inspires the future work and indicates that the future models should be small and refined, can response diverse dataset, and have interpretability.

Figure

<Figure 1>.

The Standard Pipeline of End-to-End Lightweight FR System

<Figure 2>.

The Number of Parameters (M) of The Models vs. Performance

Table

<Table 1>.

The Categorization of Lightweight FR Models

<Table 2>.

SqueezeFaceNet Architectures

<Table 3>.

EfficientFaceNet-s Architectures

<Table 4>.

Benchmarks Used for Testing on The Different Face Recognition Scenarios

<Table 5>.

1:1 Face Verification Performance (%) on 7 Different Datasets

<Table 6>.

1:1 Face Verification Performance (%) on IJB-B and IJB-C

<Table 7>.

1:N Face Identification performance (%) on IJB-B and IJB-C

Reference

Abate, A.F., Nappi, M., Riccio, D., and Sabatino, G., 2D and 3D face recognition: A survey, Pattern Recognition Letters, 2007,Vol.28,No.14,pp.1885-1906.
Adjabi, I., Ouahabi, A., Benzaoui, A., and Taleb-Ahmed, A., Past, present, and future of face recognition: A review, Electronics, 2020, Vol. 9, No. 8, pp. 1188.
Alonso-Fernandez, F., Barrachina, J., Hernandez-Diaz, K., and Bigun, J., SqueezeFacePoseNet: Lightweight Face Verification Across Different Poses for Mobile Platforms, International Conference on Pattern Recognition, 2021, pp. 139-153.
Belhumeur, P.N., Hespanha, J.P., and Kriegman, D.J., Eigenfaces vs. fisherfaces: Recognition using class specific linear projection, IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997, Vol, 19, No. 7, pp. 711-720.
Boutros F., Damer, N., and Kuijper, A., QuantFace: Towards lightweight face recognition by synthetic data low-bit quantization, arXiv preprint arXiv:2206.10526, 2022, pp. 855-862.
Boutros, F., Damer, N., Fang, M., Kirchbuchner, F., and Kuijper, A., Mixfacenets: Extremely efficient face recognition networks, 2021 IEEE International Joint Conference on Biometrics (IJCB), 2021, pp. 1-8.
Boutros, F., Siebke, P., Klemt, M., Damer, N., Kirchbuchner, F., and Kuijper, A., PocketNet: extreme lightweight face recognition network using neural architecture search and multistep knowledge distillation, IEEEAccess, 2022, pp. 46823-46833.
Cai, H., Zhu, L., and Han, S., Proxylessnas: Direct neural architecture search on target task and hardware, arXiv preprint arXiv:1812.00332, 2018.
Cao, Q., Shen, L., Xie, W., Parkhi, O. M., and Zisserman, A., Vggface2: A dataset for recognising faces across pose and age, 2018 13^th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), 2018.
Cao, Z., Yin, Q., Tang, X., and Sun, J., Face recognition with learning-based descriptor, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010, pp. 2707-2714.
Chen S., Liu, Y., Gao, X., and Han, Z., Mobilefacenets: Efficient cnns for accurate real-time face verification on mobile devices, Chinese Conference on Biometric Recognition, 2018, pp. 428-438.
Chen, K., Taihe, Y., and Qi, L., LightQNet: Lightweight Deep Face Quality Assessment for Risk-Controlled Face Recognition, IEEE Signal Processing Letters, 2021, Vol. 28, pp. 1878-1882.
Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., and Fei-Fei, L., Imagenet: A large-scale hierarchical image database, 2009 IEEE Conference on Computer Vision and Pattern Recognition, IEEE, 2009, pp. 248-255.
Deng, J., Guo, J., Xue, N., and Zafeiriou, S., Arcface: Additive angular margin loss for deep face recognition, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, 4690-4699.
Deng, J., Guo, J., Zhang, D., Deng, Y., Lu, X., and Shi, S., Lightweight face recognition challenge, Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops. 2019.
Deng, W., Hu, J., and Guo, J., Extended SRC: Undersampled face recognition via intraclass variant dictionary, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, Vol. 34, No. 9, pp. 1864-1870.
Du, H., Shi, H., Zeng, D., Zhang, X.P., and Mei, T., The elements of end-to-end deep face recognition: A survey of recent advances, ACM Computing Surveys (CSUR), 2022, Vol.54, No.10s, pp. 1-42.
Duong, C.N., Luu, K., Quach, K.G., and Le, N., Shrinkteanet: Million-scale lightweight face recognition via shrinking teacher-student networks, arXiv preprint arXiv:1905.10620, 2019.
Duong, C.N., Quach, K.G., Jalata, I., Le, N., and Luu, K., Mobiface: A lightweight deep learning face recognition on mobile devices, 2019 IEEE 10^th International Conference on Biometrics Theory, Applications and Systems (BTAS), 2019, pp. 1-6.
Feng, Y., Wang, H., Hu, H.R., Yu, L., Wang, W., and Wang, S., Triplet distillation for deep face recognition, 2020 IEEE International Conference on Image Processing (ICIP), 2020, pp. 808-812.
Fu, C., Zhou, X., He, W., and He, R., Towards Lightweight Pixel-Wise Hallucination for Heterogeneous Face Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022.
Ge, S., Zhao, S., Li, C., and Li, J., Low-resolution face recognition in the wild via selective knowledge distillation, IEEE Transactions on Image Processing, 2018, Vol.28, No. 4, pp. 2051-2062.
Ge, S., Zhao, S., Li, C., Zhang, Y., and Li, J., Efficient low-resolution face recognition via bridge distillation, IEEE Transactions on Image Processing, 2020, Vol.29, pp. 6898-6908.
Guo, L., Bai, H., and Zhao, Y., A lightweight and robust face recognition network on noisy condition, 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2019, pp. 1964-1969.
Guo, Y., Zhang, L., Hu, Y., He, X., and Gao, J., Ms-celeb- 1m: A dataset and benchmark for large-scale face recognition, European Conference on Computer Vision, 2016, pp. 87-102.
He, K., Zhang, X., Ren, S., and Sun, J., Deep residual learning for image recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 770-778.
He, X., Yan, S., Hu, Y., Niyogi, P., and Zhang, H.J., Face recognition using laplacianfaces, IEEE transactions on Pattern Analysis and Machine Intelligence, 2005, Vol. 27, No. 3, pp. 328-340.
Hinton, G., Vinyals, O., and Dean, J., Distilling the knowledge in a neural network, arXiv preprint arXiv:1503. 02531 2.7, 2015.
Hoo, S.C., Ibrahim, H., and Suandi, S.A., ConvFaceNeXt: Lightweight Networks for Face Recognition, Mathematics, 2022, Vol. 10, No. 19.
Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H., Mobilenets: Efficient convolutional neural networks for mobile vision applications, arXiv preprint arXiv:1704. 04861, 2017.
Huang, G., Liu, Z., VanDer Maaten, L., and Weinberger, K.Q., Densely connected convolutional networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017, pp. 4700-4708.
Huang, G.B., Mattar, M., Berg, T., and Learned-Miller, E., Labeled faces in the wild: A database forstudying face recognition in unconstrained environments. Workshop on faces in'Real-Life'Images: detection, alignment, and recognition, 2008.
Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., and Keutzer, K., SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size, arXiv preprint arXiv:1602.07360, 2016
Jafri, R. and Arabnia, H.R., A survey of face recognition techniques, Journal of Information Processing Systems, 2009, Vol.5, No.2, pp. 41-68.
Kim, M., Jain, A.K., and Liu, X., AdaFace: Quality Adaptive Margin for Face Recognition, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 18750-18759.
Krizhevsky, A., Sutskever, I., and Hinton, G. E., Imagenet classification with deep convolutional neural networks, Communications of the ACM, 2017, Vol. 60, No. 6, pp. 84-90.
LeCun, Y., Boser, B., Denker, J., Henderson, D., Howard, R., Hubbard, W., and Jackel, L., Handwritten digit recognition with a back-propagation network, Advances in Neural Information Processing Systems, 1989, Vol. 2,pp. 396-404.
Lei, Z., Pietikäinen, M., and Li, S.Z., Learning discriminant face descriptor, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, Vol. 36, No. 2, pp. 289-302.
Li, X., Wang, F., Hu, Q., and Leng, C., Airface: Lightweight and efficient model for face recognition, Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019.
Li, Z., Xi, T., Deng, J., Zhang, G., Wen, S., and He, R., Gp-nas: Gaussian process based neural architecture search, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 11933-11942.
Lin, M., Chen, Q., and Yan, S., Network in network, arXiv preprint arXiv:1312.4400, 2013.
Liu, C. and Wechsler, H., Gabor feature based classification using the enhanced fisher linear discriminant model for face recognition, IEEE Transactions on Image processing, 2002, Vol. 11, No. 4, pp. 467-476.
Liu, W., Wen, Y., Yu, Z., Li, M., Raj, B., and Song, L., Sphereface: Deep hypersphere embedding for face recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 212-220.
Liu, W., Zhou, L., and Chen, J., Face recognition based on lightweight convolutional neural networks, Information 2021, Vol. 12, No. 5, pp. 191.
Lyu, Y., Jiang, J., and Zhang, K., Hua, Y., Cheng, M., Factorizing and reconstituting large-kernel MBConv for lightweight face recognition, Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019.
Ma, N., Zhang, X., Zheng, H.T., and Sun, J., Shufflenet v2: Practical guidelines for efficient cnn architecture design, Proceedings of the European conference on computer vision (ECCV), 2018, pp. 116-131.
Ma, Y., Effective methods for lightweight image-based and video-based face recognition, Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019.
Martindez-Diaz, Y., Luevano, L.S., Mendez-Vazquez, H., Nicolas-Diaz, M., Chang, L., and Gonzalez-Mendoza, M., Shufflefacenet: A lightweight face architecture for efficient and highly-accurate face recognition, Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019.
Martinez-Diaz, Y., Nicolas-Diaz, M., Mendez-Vazquez, H., Luevano, L.S., Chang, L., Gonzalez-Mendoza, M., and Sucar, L.E., Benchmarking lightweight face architectures on specific face recognition scenarios, Artificial Intelligence Review, 2021, pp. 1-44.
Masi, I., Wu, Y., Hassner, T., and Natarajan, P., Deep face recognition: A survey, 2018 31^st SIBGRAPI Conference on Graphics, Patterns and Images, 2018, pp. 471-478.
Maze, B., Adams, J., Duncan, J.A., Kalka, N., Miller, T., Otto, C., Jain, A.K., Niggel, W.T., Anderson, J., Cheney, J., and Grother, P., Iarpa janus benchmark-c: Face dataset and protocol, 2018 International Conference on Biometrics, 2018, pp 158-165.
Micikevicius P., Narang, S., Alben, J., Diamos, G., Elsen, E., Garcia, D., Ginsburg, B., Houston, M., Kuchaiev, O., Venkatesh, G., and Wu, H., Mixed precision training, arXiv preprint arXiv:1710.03740, 2017.
Moghaddam, B., Jebara, T., and Pentland, A., Bayesian face recognition, Pattern recognition, 2000, Vol. 33, No. 11, pp. 1771-1782.
Moschoglou, S., Papaioannou, A., Sagonas, C., Deng, J., Kotsia, I., and Zafeiriou, S., Agedb: the first manually collected, in-thewild age database, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops, Honolulu, HI, USA, 2017, pp. 51-59.
Parkhi, O. M., Vedaldi, A., and Zisserman, A., Deep face recognition, 2015.
Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., and Lerer, A., Automatic differentiation in pytorch, NeurIPS workshop 2017.
Romero, A., Ballas, N., Kahou, S.E., Chassang, A., Gatta, C., and Bengio, Y., Fitnets: Hints for thin deep nets, arXiv preprint arXiv:1412.6550, 2014.
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., KArpathy, A., Khosla, A., Bernstein, M., and Fei-Fei, L., Imagenet large scale visual recognition challenge, International Journal of Computer Vision, 2015, Vol. 115, pp. 211-252.
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.C., Mobilenetv2:Inverted residuals and linear bottlenecks, Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 4510- 4520.
Schroff, F., Kalenichenko, D., and Philbin, J., Facenet: A unified embedding for face recognition and clustering, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015.
Sengupta, S., Chen, J.C., Castillo, C., Patel, V.M., Chellappa, R., and Jacobs, D.W., Frontal to profile face verification in the wild, 2016 IEEE Winter Conference on Applications of Computer Vision(WACV), Lake Placid, NY, USA, 2016, pp.1-9.
Simonyan, K. and Zisserman, A., Very deep convolutional networks for large-scale image recognition, arXiv preprint arXiv:1409.1556, 2014.
Sun, , Y., Wang, X., and Tang, X., Deeply learned face representations are sparse, selective, and robust, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015, pp. 2892-2900.
Sun, Y., Chen, Y., Wang, X., and Tang, X., Deep learning face representation by joint identification-verification, Advances in Neural Information Processing Systems, Vol. 27, 2014.
Sun, Y., Liang, D., Wang, X., and Tang, X., Deepid3: Face recognition with very deep neural networks, arXiv preprint arXiv:1502.00873, 2015.
Sun, Y., Wang, X., and Tang, X., Deep learning face representation from predicting10,000 classes, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp 1891-1898.
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A., Going deeper with convolutions, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015, pp.1-9.
Taigman, Y., Yang, M., Ranzato, M. A., and Wolf, L., Deepface: Closing the gap to human-level performance in face verification, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp. 1701-1708.
Tan, M. and Le, V., Mixconv: Mixed depthwise convolutional kernels, arXiv preprint arXiv:1907.09595, 2019.
Tan, M. and Quoc, L., Efficientnet: Rethinking model scaling for convolutional neural networks, International Conference on Machine Learning, 2019, pp. 6105-6114.
Turk, M. and Pentland, A., Eigenfaces for recognition, Journal of cognitive neuroscience, 1991, Vol. 3, No. 1, pp. 71-86.
Wang, M. and Deng, W., Deep face recognition: A survey, Neurocomputing, 2021, Vol. 429, pp. 215-244.
Wang, H., Wang, Y., Zhou, Z., Ji, X., Gong, D., Zhou, J., Li, Z., and Liu, W., Cosface: Large margin cosine loss for deep face recognition, Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2018, pp. 5265-5274.
Wang, X., Fu, T., Liao, S., Wang, S., Lei, Z., and Mei, T., Exclusivity-consistency regularized knowledge distillation for face recognition, European Conference on Computer Vision, 2020, pp. 325-342.
Wang, X., Teacher guided neural architecture search for face recognition, Proceedings of the AAAI Conference on Artificial Intelligence, 2021, Vol. 35. No. 4, pp. 2817-2825.
Whitelam, C., Taborsky, E., Blanton, A., Maze, B., Adams, J., Miller, T., Kalka, N., Jain, A.K., Duncan, J.A., Allen, K., and Cheney, J., Iarpa janus benchmark-b face dataset, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017, pp 90-98
Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., and Ma, Y., Robust face recognition via sparse representation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, Vol. 31, No. 2, pp. 210-227.
Wu, X., He, R., Sun, Z., and Tan, T., A light CNN for deep face representation with noisy labels, IEEE Transactions on Information Forensics and Security, 2018, Vol. 13, No. 11, pp. 2884-2896.
Xiao, J., Jiang, G., and Liu, H., A Lightweight Face Recognition Model based on MobileFaceNet for Limited Computation Environment, EAI Endorsed Transactions on Internet of Things, 2021, Vol. 7, No. 27, pp. 1-9.
Yan, M., Zhao, M., Xu, Z., Zhang, Q., Wang, G., and Su, Z., Vargfacenet: An efficient variable group convolutional neural network for lightweight face recognition, Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019.
Yang, S., Wen, Y., He, L., Zhou, M., and Abusorrah, A., Sparse Individual Low-Rank Component Representation for Face Recognition in the IoT-Based System, IEEE Internet of Things Journal, 2021, Vol. 8, No. 24, 17320-17332.
Zagoruyko, S. and Nikos, K., Paying more attention to attention: Improving the performance of convolutional neural networks via attention transfer, arXiv preprint arXiv:1612.03928, 2016.
Zhang, L., Yang, M., and Feng, X., Sparse representation or collaborative representation: Which helps face recognition?, 2011 International Conference on Computer Vision, 2011, pp. 471-478.
Zhang, P., Zhao, F., Liu, P., and Li, M., Efficient LightweightAttention Network for Face Recognition, IEEEAccess, 2022, Vol. 10, pp. 31740-31750.
Zhang, Q., Li, J., Yao, M., Song, L., Zhou, H., Li, Z., Meng, W., Zhang, X., and Wang, G., Vargnet: Variable group convolutional neural network for efficient embedded computing, arXiv preprint arXiv:1907.05653, 2019.
Zhao, J., Lv, Y., Zhou, Z., and Cao, F., A novel deep learning algorithm for incomplete face recognition: Low-rank-recovery network, Neural Networks, 2017, Vol. 94, pp. 115-124.
Zhao, W., Chellappa, R., Phillips, P.J., and Rosenfeld, A., Face recognition: A literature survey, ACM Computing Surveys (CSUR), 2003, Vol. 35, No. 4, pp. 399-458.
Zheng, T., Deng, W., and Hu, J., Cross-age LFW: A database for studying cross-age face recognition in unconstrained environments. arXiv preprint arXiv:1708.08197, 2017.
Zhu, N., Yu, Z., and Kou, C., A new deep neural architecture search pipeline for face recognition, IEEE Access, 2020, Vol. 8, pp. 91303-91310.
Zoph, B. and Le, Q.V., Neural architecture search with reinforcement learning, arXiv preprint arXiv:1611.01578, 2016.