CN112633416A

CN112633416A - Brain CT image classification method fusing multi-scale superpixels

Info

Publication number: CN112633416A
Application number: CN202110058684.0A
Authority: CN
Inventors: 冀俊忠; 张梦隆; 张晓丹
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2021-01-16
Filing date: 2021-01-16
Publication date: 2021-04-09

Abstract

A brain CT image classification method fusing multi-scale superpixels belongs to the field of medical image research. The method has the following characteristics: 1) by means of fusion of the multi-scale super-pixels and the brain CT image, redundant image information is removed, and gray level similarity of the focus and surrounding brain tissue pixels is reduced. 2) A multi-scale superpixel encoder based on regions and boundaries is designed, and focus low-level information contained in the multi-scale superpixel is effectively extracted. 3) A fusion model of multi-scale superpixel features is designed, and high-level features extracted by a residual neural network and low-level features of multi-scale superpixels are comprehensively utilized to realize classification of brain CT. 4) Compared with the traditional deep learning algorithm, the method can effectively utilize the focus information contained in the multi-scale superpixel, thereby more accurately classifying the diseases existing in the brain CT image, and the method is reasonable and reliable and can provide powerful help for the classification of the brain CT image.

Description

Brain CT image classification method fusing multi-scale superpixels

Technical Field

The invention belongs to the field of medical image research, and particularly relates to a brain CT image classification method fusing multi-scale superpixels.

Background

The diagnosis of brain injury in clinical emergency is extremely urgent, and even a short time delay may lead to deterioration of the patient's condition. Computed Tomography (CT) is one of the most commonly used diagnostic tools, and has the characteristics of fast imaging, low cost, wide application range, high lesion inspection rate, and the like. Although brain CT can detect key and time-sensitive abnormalities such as intracranial hemorrhage, raised intracranial pressure, and skull fracture, the conventional disease classification method usually requires a radiologist to visually observe information such as the size of the hemorrhage area and estimate the midline offset, which is a relatively time-consuming process. In recent years, with the progress and development of medical imaging technology, the number of brain CT images shows geometric growth, but the growth rate of the number of radiologists is relatively slow, and the cost and the period for cultivating a qualified radiologist are high, so that the work task of the radiologist in the work is increased day by day, and the social problem of difficult medical care is indirectly caused. Therefore, the brain CT automatic classification method can assist the work of the radiologist, improve the diagnosis efficiency, reduce the misdiagnosis and missed diagnosis rate, and has very important practical significance.

In recent years, great success of Deep Learning (DL) in the field of computer vision has also promoted rapid development of medical image analysis technology, and a Convolutional Neural Network (CNN) is a traditional Deep learning model, can capture local region information of an image, extract high-level semantic features, and is widely used for feature extraction and classification tasks of the image. CNN is widely used in medical image recognition and processing, and through continuous iterative optimization, many classifiers based on CNN are constructed.

However, the existing work does not consider the difference between the brain CT image and the natural optical image when using the conventional CNN to extract the brain CT image features: firstly, the brain CT image has low spatial resolution and low contrast; lack of natural visual features such as brightness, color, texture, etc. that are easily recognized; the boundary between the regions is not clear, and the texture difference is not large; images can vary significantly due to patient-to-patient variation and imaging position; the images also have many unstable factors such as displacement-caused artifacts, volume effect-caused errors, and equipment-caused noise. In addition, the medical images mostly belong to hospital private data, privacy protection rules can hinder sharing of the medical images, and the number of data sets can directly influence the deep learning effect. Researches show that the unsupervised generated superpixels are composed of a series of characteristic similar pixel points, the local details of the original image can be reserved, the local characteristics can be highlighted, the extraction and the expression of high-level characteristics of the image are facilitated, the number of the superpixels is far smaller than that of the pixels of the original image, the superpixels are used for replacing the pixels to serve as primitives of image processing, the calculation complexity of subsequent image processing can be greatly reduced, and the efficiency of an image processing algorithm is improved.

Disclosure of Invention

Aiming at the problem that the visual characteristics of the brain CT image are ignored in the existing method, the invention provides a brain CT image classification method fusing multi-scale superpixel (MSF). The method can extract the information of the focus area unsupervised by optimizing the image through multi-scale superpixels, thereby enhancing the expressiveness of the characteristics generated by the residual neural network and improving the accuracy of the classification task.

In order to achieve the purpose, the technical scheme adopted by the invention is a brain CT image classification method fusing multi-scale superpixels. The process of the present invention is shown in FIG. 1 and includes the following steps. 1) Firstly, constructing a data set and preprocessing the data set to obtain multi-scale superpixels; secondly, performing data enhancement through multi-scale super-pixel image fusion to obtain an optimized fusion image; then, a multi-scale superpixel feature coding algorithm based on region and boundary information is adopted to obtain multi-scale superpixel low-level features; finally, classifying the brain CT images by utilizing a multi-scale superpixel feature fusion classification model;

acquiring data and preprocessing:

step (1.1) data: acquisition of brain CT images to construct a data set, each patientData contains RGB matrix generated by brain CT image

With brain CT classification label vector Y ═ Y₁，Y₂，…Y_T]，Y_iE {0, 1}, where N represents the image pixel size and T represents the number of disease categories acquired.

Step (1.2) divides all patient data into a training set, a validation set and a test set. Wherein the training set is used for learning parameters of the neural network; the verification set is used for determining a network structure and a hyper-parameter; the test set is used for verifying the neural network classification effect.

Step (1.3) data preprocessing: based on Super Hierarchy segmentation algorithm (SH), for given brain CT image I and set segmentation scale { scale of Super pixel¹，scale²，…scale^sAnd S represents the number of the set superpixels, and a superpixel map under the S-th segmentation scale is generated

The calculation process of (2) is as follows:

P^s＝SH(I，scale^s)

where S ∈ {1, 2 … S }, scale^sFor the s-th segmentation scale, each segmentation scale is calculated to obtain

Representing a multi-scale superpixel containing a map of S different-scale superpixels.

Step (2), multi-scale super-pixel image fusion model: for given brain CT image I and multi-scale superpixel thereof

The fused image I' is calculated as follows:

wherein, f (·) indicates a dot product, f (·) is a SoftMax function,

represents the weight of the training, P^sWatch holder

And the middle s element, W, realizes the self-adaptive distribution of the proportion of each scale.

And (3) multi-scale superpixel feature coding:

step (3.1) for multi-scale superpixel of brain CT image I

Scale of middle division^sSuper pixel map of

Generating a set of pixel values thereof

To pair

Generates a set of mapping matrices for each pixel in the image

Wherein the kth mapping matrix

Middle i, j element

The calculation method of (c) is as follows:

wherein k ∈ {1, 2, … scale^sTherein of

Represents P^sPixel value of the kth super pixel, M^s，kRepresenting a pixel value of

Is mapped to the super pixel area.

Step (3.2) set M of pairs based on area and boundary information^sEach mapping matrix is encoded to obtain a superpixel map P^sCoding results

The calculation process is as follows:

wherein N is²Representing the number of pixels, s, contained in the superpixel map_kAnd the number of pixel points contained in the kth super pixel region is represented.

Step (3.3) pair

Repeating the step (3.1) and the step (3.2) for each superpixel graph, and sequentially generating an encoding result b¹，b²，…b^SSplicing them into a matrix

Obtaining multi-scale superpixels

And (4) encoding the multi-scale superpixel feature B.

Step (4), multi-scale superpixel feature fusion classification model:

step (4.1) constructing a Residual neural Network (ResNet) as a main Network, inputting the brain CT fusion image I' extracted in step (2), and selecting ResNetFeature activation output l of last residual structure (Basic Block) of four layers₁、l₂、l₃、l₄As a high-level feature.

Step (4.2) performs dimensionality reduction processing on the low-level feature B extracted in step (3.3), and generates features through a convolution layer formed by 256 convolution kernels with the size of 3 multiplied by 3

Will f is₀From bottom to top in sequence₁、l₂、l₃、l₄Performing multi-layer fusion to generate fusion feature f₁、f₂、f₃、f₄. Wherein f is_i(i ∈ {1, 2, 3, 4}) is calculated as follows: firstly, f is subjected to pooling operation_(i-1)Conversion to and_ifeature matrices of the same size, then l convolutional layers consisting of 256 1 × 1 convolutional kernels_iThe channel number of the channel is converted into 256, and then the characteristic fusion is carried out through matrix addition to obtain the fusion characteristic f of the ith layer_i。

Step (4.3) the obtained fusion characteristics f₄Inputting a convolution layer, a pooling layer and a full-link layer which are composed of 512 convolution kernels, obtaining classification vectors x with the same length as the label quantity T, and generating a predicted value vector y through Sigmoid linear regression by using the classification vectors x [ y ═ y₁，y₂，…y_T]Wherein y is_i∈[0，1]The ith element in x generates the probability y that the corresponding label is a positive example_iExpressed as: y is_i＝Sigmoid(x_i) Determining a classification result according to a set classification threshold value t when y is_iWhen the value is larger than the set threshold value t, the model judges that the brain CT has the corresponding label disease, if y_iAnd if the threshold value is smaller than the set threshold value t, the operation is normal. t is 0.5.

And (4.4) inputting the brain CT image I of the patient and a brain disease classification label Y into the brain CT image classification method fusing the multi-scale superpixels, and then obtaining the probability Y of the tested object belonging to each class. Given a data set of M patients D { (I)¹，Y¹)，(I²，Y²)，…，(I^M，Y^M) For a given brain CT image }IⁱCorresponding label YⁱAnd label predictions y generated by the modelⁱWe calculate the classification error of each label in the sample by Binary cross entropy loss (BCE loss), and then get the sample error by averaging all label classification errors, and the loss function is calculated as follows:

wherein

The value of the jth label representing the sample,

the value of the jth label predicted by the model is represented, and T represents the number of labels in the sample.

Step (4.5) aiming at the training set in step (1.2), minimizing the loss function in step (4.4) by utilizing Adam adaptive optimization algorithm, observing the classification accuracy rate of the model on the verification set after the model is trained by the training set under different learning rates lambda, and setting the initial value of general lambda to be 10^-3To 10^-6The next learning rate is set to be 3 times of the last time, and the maximum value of λ is set to be 0.1 to 0.5, and then the learning rate in which the accuracy is highest is selected to train the model.

After all the steps are completed, the new brain CT data set can be input into the model, and the brain CT images are classified according to the prediction result output by the model.

Compared with the prior art, the method has the following obvious advantages and beneficial effects:

the invention provides a brain CT image classification method fusing multi-scale superpixels, which has the following characteristics compared with the traditional image classification network: 1) by means of fusion of the multi-scale super-pixels and the brain CT image, redundant image information is removed, and gray level similarity of the focus and surrounding brain tissue pixels is reduced. 2) A multi-scale superpixel encoder based on regions and boundaries is designed, and focus low-level information contained in the multi-scale superpixel is effectively extracted. 3) A fusion model of multi-scale superpixel features is designed, and high-level features extracted by a residual neural network and low-level features of multi-scale superpixels are comprehensively utilized to realize classification of brain CT. 4) Compared with the traditional deep learning algorithm, the method can effectively utilize the focus information contained in the multi-scale superpixel, thereby more accurately classifying the diseases existing in the brain CT image.

Drawings

FIG. 1: a brain CT image classification method flow chart fusing multi-scale superpixels.

FIG. 2: and fusing the classification model by the multi-scale superpixel features.

FIG. 3: and fusing models of different size features.

FIG. 4: and (3) fusing and visualizing the multi-scale superpixel brain CT image.

Detailed Description

In the present embodiment, the subject of the study is a cerebral hemorrhage patient, but the method is not limited thereto, and a cerebral CT image of a patient with a cerebral disease may be used as the subject of the study. The following takes a real cerebral hemorrhage CT data set as an example, and specifically describes the implementation steps of the method:

acquiring data and preprocessing:

step (1.1) data: the invention uses CQ500 data set (http:// headctstudy. qure. ai/data set) to collect brain CT image and construct data set, practically obtains 451 scan data, total 22773 brain CT images, each patient label information contains 14 diagnosis categories of brain diseases: intracranial hemorrhage, cerebral parenchymal hemorrhage, ventricular hemorrhage, subdural hemorrhage, epidural hemorrhage, subarachnoid hemorrhage, left-side cerebral hemorrhage, right-side cerebral hemorrhage, chronic hemorrhage, fracture, skull fracture, other fractures, midline shift, mass effects. First, for the label of the data set, since three radiologists may have different labels for the same label, we select the choice of most experts as the true label for the case of non-uniform labels. Then, according to the determined intracranial hemorrhage label information, the method willThe data set is divided into 204 cases of patients with confirmed cerebral hemorrhage and 247 cases of patients with undetermined cerebral hemorrhage, 742 images corresponding to the lesion positions are selected from the cases of patients with confirmed cerebral hemorrhage according to the diagnosis type labels, 1045 images at the same positions as the lesion positions in the cases of patients with cerebral hemorrhage are selected from the cases of patients with undetermined hemorrhage, and M is 1787 brain CT images in total to serve as the data set D { (I)¹，Y¹)，(I²，Y²)，…，(I¹⁷⁸⁷，Y¹⁷⁸⁷)}. Each data contains RGB matrix generated by CT image of brain

And T ═ 14 brain disease diagnosis label vector Y ═ Y₁，Y₂，…Y₁₄]，Y_iE {0, 1}, where element Y of the tag _i1 indicates the presence of a brain disease corresponding to the i-th label in the brain CT image, and Y_iWhen 0, it is normal.

Step (1.2) divide all patient data into training set, validation set and test set according to 8: 1. Wherein the training set is used for learning parameters of the neural network; the verification set is used for determining a network structure and a hyper-parameter; the test set is used for verifying the neural network classification effect.

Step (1.3) data preprocessing: based on Super Hierarchy segmentation algorithm (SH), a given brain CT image I and a set segmentation scale {5, 10, 15} of 3 superpixels are generated for a given brain CT image I and a set S

The calculation process of (2) is as follows:

P¹＝SH(I，5)

P²＝SH(I，10)

P³＝SH(I，15)

wherein,

representing a multi-scale superpixel containing 3 different-scale superpixel maps.

Step (2) multi-scale superpixelAn image fusion model: for given brain CT image I and multi-scale superpixel thereof

Fusing images

The calculation process is as follows:

wherein, f (·) indicates a dot product, f (·) is a SoftMax function,

represents the weight of the training, P^sWatch holder

And the middle s element, W, realizes the self-adaptive distribution of the proportion of each scale. .

And (3) multi-scale superpixel feature coding:

step (3.1) for multi-scale superpixel of brain CT image I

Super pixel map with a median segmentation scale of 5

Wherein scale¹Generating its set of pixel values 5

To pair

Generates a set of mapping matrices M for each pixel¹＝{M^1，1，M^1，2，…，M^1，5Wherein the kth mapping matrix

Middle i, j element

The calculation method of (c) is as follows:

where k ∈ {1, 2, … 5}, where

Represents P¹Pixel value of the kth super pixel, M^1，kRepresenting a pixel value of

Super pixel area mapping of

Step (3.2) set M of pairs based on area and boundary information¹Each mapping matrix is encoded to obtain a superpixel map P¹Coding results

The calculation process is as follows:

wherein s is_kAnd the number of pixel points contained in the kth super pixel region is represented.

Step (3.3) pair

Repeating the step (3.1) and the step (3.2) for each superpixel graph, and sequentially generating an encoding result b¹，b²，b³Splicing them into a matrix

Obtaining multi-scale superpixels

And (4) encoding the multi-scale superpixel feature B.

Step (4), multi-scale superpixel feature fusion classification model:

step (4.1) constructing a 34-Layer residual neural network ResNet-34 as a main network, using the brain CT fusion image I' input extracted in step (2), and selecting the feature activation output of the last residual structure (Basic Block) of four layers in ResNet-34

As a high-level feature.

Will f is₀From bottom to top in sequence₁、l₂、l₃、l₄Performing multi-layer fusion to generate fusion feature f₁、f₂、f₃、f₄. Wherein f is_i(i ∈ {1, 2, 3, 4}) is calculated as follows: firstly, f is subjected to pooling operation_(i-1)Conversion to and_ifeature matrices of the same size, then l convolutional layers consisting of 256 1 × 1 convolutional kernels_iThe number of channels is converted into 256, and then characteristic fusion is carried out through matrix addition to obtain the fusion characteristic f of the f-th layer_i。

Step (4.3) the obtained fusion characteristics f₄Inputting a convolution layer, a pooling layer and a full-link layer which are composed of 512 convolution kernels, obtaining classification vectors x with the same length as the label quantity T, and generating a predicted value vector y through Sigmoid linear regression by using the classification vectors x [ y ═ y₁，y₂，…y₁₄]Wherein y is_i∈[0，1]The ith element in x generates a corresponding tag ofProbability of positive case y_iExpressed as: y is_i＝Sigmoid(x_i) When the classification threshold is set to t 0.5, y is_iWhen the value is greater than the set threshold value 0.5, the model judges that the brain CT has the corresponding label disease, if y_iAnd if the value is less than the set threshold value of 0.5, the state is normal.

And (4.4) inputting the brain CT image I of the patient and a brain disease classification label Y into the brain CT image classification method fusing the multi-scale superpixels, and then obtaining the probability Y of the tested object belonging to each class. For an input brain CT image I corresponding to a label Y and a label prediction Y generated by a model, we calculate a classification error of each label in a sample by Binary cross entropy loss (BCE loss), and then obtain a sample error by averaging classification errors of all labels, where a loss function is calculated as follows:

step (4.5) aiming at the training set in step (1.2), minimizing the loss function in step (4.4) by utilizing Adam adaptive optimization algorithm, observing the classification accuracy rate of the model on the verification set after the model is trained by the training set under different learning rates lambda, and setting the lambda initial value to be 10^-5The next learning rate is set to be 3 times of the last time, the maximum lambda value is set to be 0.1, and then the learning rate with the highest accuracy is selected to train the model.

To illustrate the beneficial effects of the method of the present invention, in the implementation process, we compare with the traditional classification model ResNet-34 as the backbone network herein, and perform three parts of the brain CT classification model with multi-scale superpixel fusion: the super-pixel image fusion, the super-pixel feature coding algorithm and the different-level feature fusion model are subjected to an ablation experiment to verify the effectiveness of each part, the experiment adopts the currently widely used Accuracy (Accuracy, ACC), Sensitivity (Sensitivity, SEN) and F1 evaluation value (F-score, F) as evaluation indexes, and the experiment results are shown in table 1.

The multi-scale superpixel selection uses a combination of superpixel maps of two scales, 5, 10, 15 and 10, 15, 20. Wherein the experimental control group Baseline is classified by using brain CT images, MSF represents the method of the invention, and MSF₀₀₁Representing the result of classification using only the feature fusion model using the original image of the brain CT as input; MSF₀₁₁The method represents that a multi-scale super-pixel image fusion part in the MSF method is removed; MSF₁₀₁Representing the super-pixel characteristic coding part in the MSF method; MSF₁₀₀And the method represents the part for removing the multi-scale superpixel image fusion and the superpixel feature coding in the MSF method.

TABLE 1 brain CT classification model contrast experiment with fusion of multi-scale superpixels

By comparison of MSF₀₁₁And the classification effect of MSF can be seen: the classification effect is slightly improved compared with Baseline under the condition of not using multi-scale superpixel brain CT image fusion, but is still not as good as the MSF method. By comparison of MSF₁₀₁And the classification results of MSF can be seen: although the classification effect is obviously improved compared with Baseline under the condition of not using a superpixel feature coding algorithm, the multi-scale superpixel graph is directly used without reference area and boundary information, the expression is poor, the multi-scale superpixel graph cannot be directly used as a low-level feature, the fusion feature effect with a high-level feature is poor, and the MSF classification effect has a larger difference. By MSF₀₀₁And Baseline, MSF₁₀₀And MSF₁₀₁Two groups of comparison experiments show that the classification effect is remarkably improved by using the feature fusion model compared with the method of only using ResNet to extract features no matter whether the input is an original image or an image subjected to multi-scale super-pixel fusion.

In addition, a weighted graph of the multi-scale superpixel and the weight and a multi-scale superpixel fusion image I' in the multi-scale superpixel fusion process are visually displayed, as shown in FIG. 4. Clearly, the weighted graph well removes redundant information outside the cranium, accurately marks a focus area and does not excessively cut; the fused image I' can obviously distinguish the focus area, reduce the gray level similarity of the focus and the surrounding brain tissue pixels and better express the focus area.

In conclusion, the effectiveness of the MSF method provided by the text in the brain CT image classification task is verified by comparing the Baseline method with the ablation experiment. The fusion image generated by the fusion based on the multi-scale superpixel brain CT image model effectively reduces the noise of the image and accurately divides the focus area; the multi-scale superpixel encoder extracts accurate low-level features, wherein the low-level features comprise region area and boundary information, so that small-area focuses can be better concerned; the feature fusion model generates more discriminative fusion features by fusing two different levels of features, and has more effective expression on focus regions with unfixed areas. Therefore, the method is reasonable and reliable, and can provide powerful help for the classification of the brain CT image.

Claims

1. A brain CT image classification method fusing multi-scale superpixels is characterized by comprising the following steps: firstly, constructing a data set and preprocessing the data set to obtain multi-scale superpixels; secondly, performing data enhancement through multi-scale super-pixel image fusion to obtain an optimized fusion image; then processing the multi-scale superpixel by adopting a feature coding algorithm based on region and boundary information to obtain a multi-scale superpixel low-level feature; finally, classifying the brain CT images by utilizing a multi-scale superpixel feature fusion classification model;

acquiring data and preprocessing:

step (1.1) data: acquiring brain CT images to construct a data set, each patient data comprising its RGB matrix generated from the brain CT images

With brain CT classification label vector Y ═ Y₁，Y₂，…Y_T]，Y_iE {0, 1}, where

Representing a real number set, N representing the size of an image pixel, and T representing the number of acquired disease categories;

step (1.2) dividing all patient data into a training set, a validation set and a test set; wherein the training set is used for learning parameters of the neural network; the validation set is used for determining the hyper-parameters; the test set is used for verifying the neural network classification effect;

The calculation process of (2) is as follows:

P^s＝SH(I，scale^s)

Representing a multi-scale superpixel comprising a map of S different-scale superpixels;

The fused image I' is calculated as follows:

wherein, f (·) indicates a dot product, f (·) is a SoftMax function,

represents the weight of the training, P^sTo represent

The middle-s element, W realizes the self-adaptive distribution of the proportion of each scale;

and (3) multi-scale superpixel feature coding:

step (3.1) for multi-scale superpixel of brain CT image I

Scale of middle division^sSuper pixel map of

Generating a set of pixel values thereof

To pair

Generates a set of mapping matrices for each pixel in the image

Wherein the kth mapping matrix

Middle i, j element

The calculation method of (c) is as follows:

wherein k ∈ {1, 2, … scale^sTherein of

The super pixel area mapping;

The calculation process is as follows:

wherein N is²Representing the number of pixels, s, contained in the superpixel map_kThe number of pixels included in the kth super pixel region indicates a dot product;

step (3.3) pair

Obtaining multi-scale superpixels

B, multi-scale superpixel feature coding;

step (4), multi-scale superpixel feature fusion classification model:

step (4.1) constructing a residual neural network ResNet as a main network, using the brain CT fusion image I' input extracted in step (2), and selecting the feature activation output l of the last residual structure (Basic Block) of four layers in ResNet₁、l₂、l₃、l₄As a high-level feature;

Will f is₀From bottom to top in sequence₁、l₂、l₃、l₄Performing multi-layer fusion to generate fusion feature f₁、f₂、f₃、f₄(ii) a Wherein f is_i(i ∈ {1, 2, 3, 4}) is calculated as follows: firstly, f is subjected to pooling operation_(i-1)Conversion to and_ifeature matrices of the same size, then l convolutional layers consisting of 256 1 × 1 convolutional kernels_iThe channel number of the channel is converted into 256, and then the characteristic fusion is carried out through matrix addition to obtain the fusion characteristic f of the ith layer_i；

Step (4.3) the obtained fusion characteristics f₄Inputting a convolution layer, a pooling layer and a full-link layer which are composed of 512 convolution kernels, obtaining classification vectors x with the same length as the label quantity T, and generating a predicted value vector y through Sigmoid linear regression by using the classification vectors x [ y ═ y₁，y₂，…y_T]Wherein y is_i∈[0，1]The ith element in x generates the probability y that the corresponding label is a positive example_iExpressed as: y is_i＝Sigmoid(x_i) Determining a classification result according to a set classification threshold value t when y is_iWhen the value is larger than the set threshold value t, the model judges that the brain CT has the corresponding label disease, if y_iIf the threshold value is less than the set threshold value t, the operation is normal; t is 0.5;

inputting a patient brain CT image I and a brain disease classification label Y, and then obtaining the probability Y of the tested object belonging to each category; if it is given toData sets D { (I) of M patients were derived¹，Y¹)，(I²，Y²)，…，(I^M，Y^M) For a given brain CT image IⁱCorresponding label YⁱAnd label predictions y generated by the modelⁱCalculating the classification error of each label in the sample through the two-classification cross entropy loss, and then obtaining the sample error by averaging all the label classification errors, wherein the loss function is calculated as follows:

wherein

The value of the jth label representing the sample,

representing the value of the jth label predicted by the model, and T representing the number of labels in the sample;

step (4.5) aiming at the training set in step (1.2), minimizing the loss function in step (4.4) by utilizing Adam adaptive optimization algorithm, observing the classification accuracy rate of the model on the verification set after the model is trained by the training set under different learning rates lambda, and setting the lambda initial value to be 10^-5Setting the next learning rate to be 3 times of the last time, setting the maximum lambda value to be 0.1, and then selecting the learning rate with the highest accuracy to train the model;

and (5) after all the steps are finished, inputting a new brain CT data set into the model, and classifying the brain CT images according to a prediction result output by the model.