Classification of the Traffic Status Subcategory with ETC Gantry Data: An Improved Support Tensor Machine Approach
Classification of the Traffic Status Subcategory with ETC Gantry Data: An Improved Support Tensor...
Zhao, Yan;Lu, Wenqi;Rui, Yikang;Ran, Bin
2023-03-30 00:00:00
Hindawi Journal of Advanced Transportation Volume 2023, Article ID 2765937, 21 pages https://doi.org/10.1155/2023/2765937 Research Article Classification of the Traffic Status Subcategory with ETC Gantry Data: An Improved Support Tensor Machine Approach 1,2 1,2 1,2 1,2 Yan Zhao, Wenqi Lu, Yikang Rui , and Bin Ran School of Transportation, Southeast University, Nanjing, Jiangsu 211189, China Joint Research Institute on Internet of Mobility, Southeast University and University of Wisconsin-Madison, Southeast University, Nanjing 211189, China Correspondence should be addressed to Yikang Rui; 101012189@seu.edu.cn Received 31 October 2022; Revised 17 February 2023; Accepted 1 March 2023; Published 30 March 2023 Academic Editor: Jinjun Tang Copyright © 2023 Yan Zhao et al. Tis is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Accurate and reliable trafc state identifcation is the prerequisite for developing intelligent trafc programs. With the im- provement of intelligent trafc control measures, the trafc state of some highways has gradually stabilized. Te current research on trafc state identifcation needs to fully meet the highly informative intelligent trafc system and trafc state subcategory analysis. To fll the gap above, we propose an improved support tensor machine (STM) method based on self-training and multiclassifcation for trafc state subcategory identifcation (ISTM) with ETC gantry data. Tis paper takes the excellent ap- plication of the support vector machine (SVM) in trafc state identifcation as the starting point of method design and extends to the STM. Te ETC gantry data are represented as a third-order tensor model. Tis paper utilizes the similarity among tensor samples to construct the kernel function and recognize the trafc states. We simplify STM calculation with a one-against-one model and a self-training idea. An optimal ft of the characteristics is supplied by maximizing inter-subcategory tensor block distances and minimizing intra-subcategory tensor block distances throughout a joint utilization of the STM and multiscale training theories. Te experiment in this paper uses ETC gantry data from the Jingtai highway in Shandong Province, and the fndings reveal that the ISTM has optimum values of 0.2578 and 0.3254 for the SumD and 0.1718 and 0.1901 for the DBI as compared to K-mean clustering and the SVM. Te ISTM trains the trafc state subcategory classifers with high accuracy and strong generalization ability. and provide an in-depth understanding of the traveler’s 1. Introduction needs. Te management can develop fne-grained trafc Practical and reliable trafc state identifcation is one of the management content for diferent objectives. Trafc operators critical components of the intelligent transportation system, can analyze and compare subcategories of each trafc state which is the foundation for the ITS system to improve and make the necessary control technology updates. For highway operations efciency and trafc control scheme example, Trafc operators can provide travel guidance to maximize road network efciency. It is easy to obtain trafc development. With the improvement of intelligent control measures, the trafc state of some highways gradually tends to information feedback and improve highways’ resilience after be smooth and less fuctuating. An efective trafc state the trafc conditions refnement. identifcation method helps to grasp the changes in the trafc At present, the criteria of highway trafc states could be state from the global perspective, which provides an essential divided into absolute and relative metrics [1]. Relative scientifc basis for trafc management and control. A fne- metrics use historical trafc data to describe the highway grained trafc state classifcation can explain the chaotic part trafc condition in a specifc time and space. Te relative of the trafc state and enable cognitive upgrading of trafc metrics of trafc state include fow rate, speed, and density. states. Te refned trafc state subcategories are more specifc Te relative metrics are usually divided into four categories, 2 Journal of Advanced Transportation data analysis [9]. Tis paper uses ETC gantry data’s rich including “free,” “smooth,” “congested,” and “blocked” [2]. Tese four trafc states are highly variable, they vary sig- dimensionality and fast update speed as the entry point to support trafc states subcategories parameter analysis. nifcantly in the granularity of the trafc fow parameters, and it is easy to classify them. Currently, the research on Tis paper proposes a trafc state subcategory identif- trafc state identifcation mainly focuses on the above four cation method based on the STM. Te highly generalized categories of trafc states. Yue et al. [3] classifed trafc states nature of the tensor model for a concise representation of into fve levels (free fow, near free fow, light congestion, multiple linear types in linear and dyadic spaces and has high moderate congestion, and severe congestion) based on the program readability. Data representation is a critical factor in vehicle kilometers traveled (VKT) for a specifc travel speed. model performance. Research shows that the third-order tensor model is superior to the matrix and fourth-order However, these fve levels still need to portray the charac- teristics of diferent trafc state subcategories fully and still tensor representations in preserving information in the data set [10]. Te STM has a good recognition efect when the belong to larger-grained trafc state categories. Researchers have started to study the vulnerability of trafc qualitatively samples are uneven, and the sample size is small. Terefore, the tensor model meets the model construction requirements or quantitatively [4]. Te highway trafc system is a complex time-varying system subject to more uncertain trafc events, of this study. Traditional STM has some limitations when and the trafc state pattern may be variable [5]. Te trafc faced with the problem of trafc state subcategory identif- state identifcation method based on the large granularity cation. First, the traditional STM is a binary classifcation classifcation cannot meet the needs of increasingly complex algorithm, which cannot deal with multicategory problems. and detailed trafc fow change research. Tere is a need to Second, the traditional STM is a supervised learning algo- refne the existing trafc state category and further deepen rithm, which requires a large amount of sample labeling work and increases the data processing workload. Finally, diferent the research on the trafc state identifcation method. It is a need for trafc state identifcation methods that can be tensor blocks contain diferent amounts of information. Traditional STM uses the same scale to express tensors and applied to efciently identify trafc state subcategories from a large amount of data. Tis paper aims to conduct research deals with fxed-size tensor objects, resulting in the need for more analysis of tensor blocks of diferent scales. on trafc state subcategory identifcation and design a method based on STM. Tis research makes some improvements to the STM. Data mining and machine learning methods strongly (1) An ETC gantry data three-order tensor model is drive trafc state research development [6, 7]. Trafc state constructed to deeply explore the relationship be- identifcation is afected by the quality of research data and tween multilevel parameters of ETC gantry data. model performance to some specifc—the trafc state sub- (2) A support tensor classifer based on self-training category research challenges the trafc data and models learning is proposed, and the improved STM is used. (1) Te collection of highway trafc data may rely on based on self-training and multiclassifcation (ISTM) commonly used detectors, such as magnetic detectors, video according to the recent experience of many appli- detectors, and mobile phone data [8]. Due to the limitation cations of the SVM in trafc state identifcation. At of detection ability (e.g., vehicle type detection and vehicle the same time, the radial basis tensor kernel function diversion detection) and low deployment density of the for the input tensor sample’s high-dimensional above detectors, the quality of trafc data obtained is difcult mapping to explore the correlation degree of dif- to guarantee. (2) Te data structure is the basis for building ferent trafc parameters. Te improvements made by the data model and the data operations based on the data the ISTM mainly include embedding the one- structure. Implementing ITS trafc data is the key to against-one mechanism to achieve multicategory building a trafc state subcategory identifcation model. classifcation. Te ISTM introduced the idea of self- Based on ensuring the model’s high interpretation, sensi- training to reduce the workload of sample calibration tivity, and stability, the model needs to dig into the mul- and extract diferent tensor blocks to represent timodal correlation between the data and learn trafc data’s multiscale tensor slices. spatial and temporal patterns under the trafc state subcategories. (3) Tis paper uses ETC gantry data for experimental Diferent standard trafc data collections are shown in comparison to verify the efectiveness of the ISTM. Tables 1 and 2. At present, ETC gantries are rapidly pop- Te main contributions of this article can be described as ularized and developed. Te ETC gantries system includes follows: static data (e.g. toll station information) and dynamic data (e.g. travel information and toll information). Tese data (i) Tis paper takes the small range of trafc state hide a large amount of valuable information. Te value of changes on the essential highway section as the ETC gantries data gradually manifests with the continuous research object, refnes the trafc state categories, development of big data technology. First, ETC gantry data and enriches the practical use meaning of trafc are updated quickly. ETC gantry could accumulate a huge state identifcation. data volume in a short period. Second, ETC gantry data have (ii) An ETC gantry data three-order tensor model- a high consistency structure, and the update overhead is based represents to preserve the spatialtemporal negligible. In addition, ETC gantry data are rich in di- information in the ETC gantry data. Te tensor mensions, which may provide various perspectives for trafc Journal of Advanced Transportation 3 Table 1: Comparison of trafc data collection methods. Collection method Advantages Disadvantages Te compatibility with trafc information collection systems based on other Less investment, not afected by external conditions such as climate and light, technologies could be better. When the vehicle speed is too fast or congested, Coil detector strong adaptability, high accuracy of speed measurement and trafc counting, the coil current does not change signifcantly, and the detection accuracy and good working stability. reduces Easily afected by the weather and the refection of sunlight, the energy refected Easy to install and maintain, the clear discrimination of models. Low energy from the vehicles in the detection area will be difused or absorbed by the Infrared detector consumption and no radiation impact on the surrounding environment. particles in the air, such as rain, snow and haze; thus afecting the detection Accurate measurement of vehicle position, speed and vehicle type. results. Te data detection accuracy of the detector is afected by the environment, especially the impact of high winds and rainstorms. Moreover, the detection High detection accuracy, and continuous detection of all weather, can also range of ultrasonic waves is tapered and is afected by the model and height of Ultrasonic detector detect stationary vehicles and trafc fow information under congestion. Te the vehicle; when the vehicle is at medium and high speeds, it generates a large detector is small, easy to install, and has a long service life. number of ultrasonic pulse repetitions that quickly make the measured occupancy small. It cannot accurately measure the speed of a single vehicle. On road sections with Easy to install and maintain, long service life, less afected by weather and more congested trafc, uneven distribution of vehicle types and more large Radar detector climate, and able to carry out multiple lane detection simultaneously. vehicles, the detection accuracy of the detector drops sharply due to severe occlusion. Relying on optical principles, factors such as dust, shadows, weather, and lighting can afect the detection results. Small vehicles may be obscured by Flexible in use, can detect multiple trafc parameters at once and has an accompanying large vehicles, resulting in inaccurate detection results. Te Video image detection extensive detection range. detection performance is afected when the vehicle and road contrast is low. Te equipment and image processing cost is high, and the real-time performance could be better. Te decrease in detection accuracy caused by the base station positioning error leads to the loss of short-distance travel within the base station cell. Low Wide coverage, large analysis samples, low implementation cost, and long-term signaling sampling frequency causes the trajectory tracking of cell phones to be Cell phone signaling data continuous monitoring. noncontinuous, making it difcult to know accurately the user’s departure time, arrival time, and dwell time—limitations caused by nonfull-sample detection. Global all-weather positioning with high positioning accuracy can provide Location is infuenced by climate, ionosphere, troposphere, air, and GPS data global unifed 3D geocentric coordinates as follows: short acquisition period, electromagnetic waves. Low sampling rates introduce sampling errors and long duration, and large data scale. positioning errors. Due to the evasion behaviours such as blocking license plates and unstable A high level of intelligent automation, wide coverage, fast update speed, and equipment working conditions, problems such as missing felds and abnormal ETC gantry data rich ETC gantry data dimensions can provide multiple perspectives for trafc data may occur in the ETC gantry data. Te trafc detection between ETC data analysis. High accuracy and good development prospects. gantry zones needs to be improved, and the quality of ETC gantry data cannot be guaranteed. 4 Journal of Advanced Transportation Table 2: Types of trafc data available to the detector. Collection method Trafc fow Occupancy Speed Vehicle type Other parameters Coil detector ✓ ✓ ✓ ✓ Vehicle length Infrared detector ✓ ✓ ✓ ✓ Queue length Ultrasonic detector ✓ × ✓ × — Radar detector ✓ ✓ ✓ × Headway Video image detection ✓ ✓ ✓ ✓ Headway, vehicle length, density, and queue length Cell phone signaling data × × ✓ × Travel path GPS data ✓ × ✓ × Travel path ETC gantry data ✓ ✓ ✓ ✓ Headway, vehicle length, density, total mileage, and trip od Note. ✓: detectable; ×: not detectable. model is more suitable for the case of a narrow Te model-driven trafc state method has solid ex- calculation range of parameter data, and small data planatory power, makes idealized assumptions, and has granularity of trafc states subcategory research. limited application scenarios for any given highway seg- ment. Tis approach may ignore certain factors, such as (iii) Te multiscale training idea and the self-training weather type, that may signifcantly afect trafc charac- idea are introduced to the learning process of the teristics when considering the impact of site roadway ge- STM. Te multiscale training idea improves the ometry and trafc features on the baseline conditions [14]. classifcation training content, and the self-training Model-driven trafc state identifcation methods based on idea realizes the expansion of the STM labeled data the model are unable to segment trafc states [15]. set to overcome the problem of the small size of the initial labeled data set. 2.2. Identifcation Methods Driven by Big Data. Seo et al. [16] Section 2 gives a general overview of the work related to and Li et al. [17] reviewed the research theories and methods highway trafc state identifcation. Section 3 defnes the of highway trafc state identifcation and concluded that tensor basis and the method of constructing three-order future research hotspots are machine learning and deep tensor models for ETC gantry data. Section 4 introduces the learning. Tsubota et al. [18] compared the trafc state ISTM method. In Section 5, the results and discussion of the identifcation methods based on two kinds of data, trafc experiments are provided. We discuss the conclusions and volume and GPS vehicle trajectory. Tang et al. [19] used the future work in Section 6. K-mean method to classify the input samples into clusters. Xu et al. [20] proposed a trafc state identifcation algorithm 2. Related Work for road trafc networks based on compressed sensing. With the maturity of fuzzy control technology, scholars in- Trafc state identifcation refers to characterize and judge troduced fuzzy theory and integration methods into the the highway trafc state by constructing a specifc identi- trafc feld. Stutz and Runkler [21] applied fuzzy clustering fcation model through current and historical trafc fow to study trafc state classifcation methods to improve the data and their mapping relationship. It is the crucial premise classical clustering model. Wang et al. [22] proposed an for evaluating road efciency, discovering trafc congestion improved trafc state identifcation model based on selective bottlenecks, and formulating trafc control plans. In recent integration learning (SEL). years, domestic and foreign scholars have conducted ex- In addition, trafc state identifcation algorithms also tensive research on trafc state identifcation methods and contain classifcation models such as the SVM [23, 24], K- proposed various identifcation methods, which have laid nearest neighbor (KNN) [25], decision trees [26], and neural a theoretical foundation. Te trafc state identifcation networks [27], whose input features are primarily in vector methods could be divided into those based on trafc fow form [28]. However, the dimensionality of the trafc net- theory, and model-driven identifcation methods and data- work feature set is large. Te dimensionality reduction is driven artifcial intelligence identifcation methods. processed as feature vector input to the classifcation model. Te structural information between diferent dimensions of the trafc fow network is easily lost, and the connection 2.1. Identifcation Methods Driven by Trafc Flow Teory Model. Te trafc fow theory-based and model-driven between road segments on the road needs to be addressed identifcation methods include the following: Yuan et al. [29]. Considering a robust spatialtemporal correlation [11] proposed a new trafc state estimator based on the model when modeling trafc fow data is crucial. extended Kalman flter (EKF) technique using an LWR Tensor modeling is a standard technique for capturing model as the process equation. Hara et al. [12] improved the multidimensional structure dependencies. Te tensor model Gaussian graphical model using the EM algorithm and the uses a compact structure to simulate the original multidi- graph lasso technique to determine the model parameters. mensional data. In addition, tensor models efectively solve Herrera and Bayen [13] integrated GPS data into the trafc the problem of the high dimensionality of feature data, and tensor-based classifcation methods have been widely used state estimation model and compared it with the application of the Kalman flter. in artifcial intelligence and other felds [30]. Large data sizes Journal of Advanced Transportation 5 characterize ETC gantry data, multiple data types, and high 3. Tensor Basis and Data Preparation dimensionality. Te previous research shows that tensors Tensor is a multiple linear mapping defned on the Cartesian could maintain the inherent structural characteristics of the product of vector space. We organize the spatialtemporal data to the maximum extent [31]. trafc data into a multidimensional structure by combining In conclusion, a tensor is a convenient tool in trans- trafc information (trafc volume and average travel speed), portation research, especially when modeling complex time series information with other locations information, transportation datasets with spatial-temporal multidimen- and the spatialtemporal trafc data is structured as a mul- sionality and efectively handling high-dimensional data tidimensional array (location × time × trafc information), information. It is necessary to further improve the accuracy i.e., a tensor structure. Te tensor array of N order is denoted of trafc state identifcation by mining complex trafc fow I ×...×I 1 N as X ∈ R , where the Kth order contains k compo- dynamics from the massive trafc big data through in- nents within elements denoted as x , where i , i , ..., i telligent techniques. Terefore, this paper proposes to use i ,i ,...,i 1 2 k 1 2 k represents the component expression under the the STM to efectively preserve the correlation structure respective order. features among trafc states of diferent road sections and Tere are many types and complex structures of data reduce the loss of feature information. tables within the ETC gantry system. Te data are updated quickly, and the value density of each type of data is uneven. 2.3. Research on Evaluation Indicators. Te results of trafc Tis paper mainly uses the property that tensor models state identifcation and the selection of identifcation in- could satisfy the transformation of tensor components in dicators have a very close correlation and are directly diferent spatial systems and use the relationship between infuenced by the selection. Te current trafc state study tensor computation and linear algebra for multidimensional combines multiple trafc state indicators to generate more trafc data computation [40]. In this section, an ETC gantry accurate and comprehensive trafc state identifcation in- data three-order tensor model is proposed to analyze the dicators. El-Hamdani and Benamar [32] integrated the trafc data contained in the ETC gantry data table based on design of diferent evaluation indicators for communication the tensor model, and the extracted various data used for type, vehicle priority, system behavior, road model, and trafc data calculation. parking avoidance. Sun et al. [33] used the average speed of Te ETC gantry license plate data table and ETC gantry roads to assess trafc conditions in urban business districts. transaction data table in the ETC gantry system are selected Zhao and Hu [34] used the extra travel time to analyze the to set the ETC gantry subtensor block model. Te subtensor spatial and temporal patterns of trafc congestion in Beijing blocks are extended to construct the tensor block model. Te over six months. Chen et al. [35] proposed a speed per- ETC gantry tensor block model is sequentially arranged into formance index to assess trafc conditions and congestion the ETC gantry three-order tensor model along the time patterns on urban freeways by considering trafc fow speeds series. We use the ETC gantry three-order tensor model to and road speed limits. Deng et al. [36] developed a new extract the required trafc data. method for trafc state evaluation based on a cloud model Te structure of ETC gantry data based on the tensor that integrates the advantages of structured trafc state model is shown in Figure 1. indicators. Evaluation based on performance and indicators is the mainstream research in evaluating trafc 3.1. ETC Gantry Subtensor Block Construction. ETC gantry conditions [37]. I ×I ×...×I x y z subtensor block model X ∈ R performs data di- To enhance the objectivity of trafc indicators, we should mensionality reduction on the ETC gantry data table and show the most comprehensive and critical information with transcodes the selected data attributes. First, the ETC gantry the help of the least number of indicators. Adequate cre- number, capture time, and identifcation license plate in the dentials for trafc management control, trafc volume, ETC gantry license plate data table are selected to build the average travel speed, and space occupancy could compre- ETC gantry license plate data subtensor block. We select hensively refect the road trafc operation [38]. Te average time, space, and vehicle user as the subtensor block di- travel speed is the most direct refection of the trafc state mension division scale and categorize the above three felds [39]. In this paper, the above three are selected as trafc state into each subtensor block dimension. identifcation indicators, calculated and characterized using a three-order tensor model. I ×I ×I time space car X ∈ R , I � t , I � s , I � c , (1) LPR time pictime space id car license where t is the vehicle capture time, s is the ETC gantry Similarly, the three felds of transaction time, gantry pictime id information, and c is the captured vehicle ID number, identifcation license plate, model information, and license information. transaction situation in the ETC gantry transaction data 6 Journal of Advanced Transportation I ×I ×I time space car X ∈ R , I � t , I � s ETC gantry Database trade time trade time space id (2) I � c , c , c , car license type match ETC gantry license ETC gantry where t is the vehicle capture time, s is the ETC gantry pictime id plate recognition Transaction datasheet information, and c is the captured vehicle ID in- datasheet license formation, vehicle type information, and transaction ETC gantry ETC gantry situation. sub-tensor block sub-tensor block construction construction 3.2. ETC Gantry Tensor Block Construction. We obtain the ETC gantry tensor block by extending each subtensor SPACE TIME CAR block model. ETC gantry tensor ETC gantry tensor ETC gantry tensor block construction block construction block construction Defnition 1. Subtensor block merging: the subtensor block ETC gantry high-order tensor models are projected on the high-order tensor dimension, model and the same-order components of the subtensor block models in the same dimension are merged. In contrast, the Trafc data calculation based on components of diferent orders are retained. Te subtensor a high-order tensor model blocks are merged in the following way: Figure 1: ETC gantry data structures. table are selected to construct the ETC gantry transaction data subtensor block as follows: I×J×K A ∈ R , I � i , i , i , J � j , j , j , K � k , k , 1 2 3 1 2 3 1 2 I×J (3) B ∈ R , I � i , i , i , J � j , j , 1 2 4 1 2 I×J×K f: A× B ⟶ C, C ∈ R , I � i , i , i , i , J � j , j , j , K � k , k . 1 2 3 4 1 2 3 1 2 �������������������������������� � i �I ,...,i �I 1 1 N N Te ETC gantry tensor block represents the trafc sta- ‖A‖ � A i , . . . , i × A i , . . . , i . (4) F 1 N 1 N tistics of an ETC gantry within 1 minute and is sorted in i �1,...,i �1 1 N a specifc sequence to simplify the calculation. Te tensor block selects a time, space, and vehicle users as the di- mensional spaces. Among them, the time dimension com- Defnition 3. Tensor inner product: the inner product be- ponent parameter is set to 1 minute, containing 60-time I ×...×I I ×...×I 1 N 1 N tween the tensorA ∈ R ,B ∈ R , defned as the component parameters. Te space dimension contains the sum of the products of its corresponding components. following two spatial factors: gantry number and lane number. Te vehicle user’s dimension contains the following i �I ,...,i �I 1 1 N N three parameters: license plate information, vehicle type 〈A,B〉 � A i , . . . , i × B i , . . . , i . (5) 1 N 1 N information, and vehicle transaction information. i �1,...,i �1 1 N 3.3. Trafc Data Calculation Based on the High-Order Tensor Model. Figure 2(a) shows the three-order tensor model Defnition 4. Tensor K mode product: the mode product of I ×...×I I ,I 1 N k k extracted from the 30-minute trafc statistics of two ETC tensor A ∈ R and matrix B ∈ R , tensor A and gantries (three lanes) with diferent colour tensor blocks matrix B is denoted as A× B, and the result is tensor I ×...I ×...×I 1 K N indicating diferent types of cars. Te basic tensor algebra C ∈ R , which is calculated by the following operations used in this paper are described as follows: formula: Defnition 2. Frobenius parametrization: the Frobenius A× B ≜ A I , . . . , I , p, I , . . . , I × B(p, q). (6) I ×...×I K 1 K−1 K+1 N 1 N parametrization ‖ · ‖ of a tensor A ∈ R is calculated p�1 as follows: G005032001000110020 G005032001000110020 G005032001000110020 G005032001000110010 G005032001000110010 G005032001000110010 PICTIME PICTIME PICTIME Journal of Advanced Transportation 7 CARTYPE CAR (a) CARTYPE CAR (b) CARTYPE CAR (c) Figure 2: Diagram of trafc data calculation based on the ETC gantry high-order tensor model: (a) diagram of the ETC gantry high-order tensor model; (b) diagram of high-order ETC gantry tensor models; (c) diagram of vehicle-level retrieval information. GANTRYID GANTRYID GANTRYID 9:00 9:01 9:02 9:03 9:04 9:05 9:06 9:07 9:08 9:00 9:09 9:01 9:02 9:10 9:03 9:11 9:04 9:12 9:05 9:13 9:06 9:14 9:07 9:15 9:08 9:16 9:00 9:09 9:01 9:17 9:10 9:02 9:18 9:11 9:03 9:19 9:12 9:04 9:13 9:20 9:05 9:06 9:14 9:21 9:07 9:15 9:22 9:08 9:16 9:23 9:09 9:17 9:24 9:10 9:18 9:25 9:11 9:19 9:12 9:26 9:20 9:13 9:27 9:21 9:14 9:28 9:22 9:15 9:29 9:23 9:16 9:30 9:17 9:24 9:18 9:25 9:19 9:26 9:20 9:27 9:21 9:28 9:22 9:29 9:23 9:30 9:24 9:25 9:26 9:27 9:28 9:29 9:30 8 Journal of Advanced Transportation We are taking an ETC gantry trafc data calculation as achieves the classifcation by solving the multidimensional an example. All trafc information of this ETC gantry could hyperplane simultaneously, as shown in Figure 3. be obtained by making a tensor slice of this gantry, as shown Te classifers are obtained by supervised learning of in Figure 2(b), including trafc volume, average headway tensor sample points to complete the classifcation of input time, the proportion of vehicle types, average interval speed, tensor points and the mapping from tensor dimensions to and space occupancy. Te vehicle-level retrieval information actual dimensions, which could be expressed as follows: is shown in Figure 2(c). Certain vehicle information can be y � f〈X;W, b〉 � 〈X,W〉 + b, (7) obtained by inputting vehicle license plate information, the tensor block model in which the vehicle is located can be where y ∈ {+1, −1} is the input sample binary category label, I ×...×I indexed, and trafc parameters calculated, including average 1 N X ∈ R is the input N order tensor training sample, travel speed and headway time. I ×...×I 1 N W ∈ R is the weight tensor of the classifcation hy- perplane, X,W are of the same scale, and 〈X,W〉 is the 4. Methodology inner product of the two tensors. Terefore, the STM classifcation problem could be transformed into solving the 4.1. Support Tensor Machine. To distinguish the vector ∗ ∗ optimal X ,W to maximize the category interval. More- sample x , x ∈ R , i � 1, . . . , l, the SVM designs the hy- i i over, the constraint expression could be set as an afne perplane w x + b � 0 to classify it with high dimensionality, function constraint and expressed as a convex optimization where w ∈ R is the hyperplane normal vector and b ∈ R is problem according to the tensor space constraint. Te su- the hyperplane intercept [41]. Te classifcation hyperplane pervised learning-based STM optimization model is distinguishes two types of data at a specifc interval, and this expressed as follows: interval distance refects the diference between the two types of data. Te larger the interval between the two types of data ‖W‖ min f(W, ε) � + C ε represents, the more signifcant the diference between the i W,ε i�1 two types of data. Terefore, the problem of fnding the best (8) decision hyperplane could be transformed into solving the s.t.y 〈W,X 〉 + b ≥ 1 − ε i i i maximum interval between two types of data to ensure that the hyperplane is robust. Te upper and lower boundaries of ε ≥ 0, i � 1, 2, . . . , l. the delineated interval must pass through some sample points closest to the decision hyperplane, and these sample Te ε represents the error tolerance of the decision vectors that determine the size of the interval distance are hyperplane for the sample points. Te correct classifcation called support vectors [42]. Te classifcation could be points are ε � 0, the points in the classifcation interval are performed based on the point’s relative position to the ε ∈ (0, 1), and incorrect classifcation points are decision hyperplane. Te SVM classifcation problem could ε ∈ (0, +∞). Te STM is updated to seek the optimal so- ∗ ∗ be transformed into solving for the best w , b that max- lution of the objective function in maximizing the classif- imizes the category interval L. cation interval and minimizing the error. It is solved using Te core idea of the STM is the same as that of the SVM, the Lagrange multiplier method, where the above inequality and the STM chooses a tensor form as the training sample constraint is solved by introducing non-negative Lagrange points, fnds the tensor hyperplane in the tensor dimension, factors λ and β . i i and has a higher generalization of the data features. Te STM 2 l l l ‖W‖ (9) L W, b, λ , β � + C ε − λ y 〈W,X 〉 + b − 1 + ε − β ε . i i i i i i i i i i�1 i�1 i�1 To improve the efciency of the solution, the model is W � λ y X , i i i i�1 transformed into dual problem-solving. Te sufcient condition for the original problem and the dual problem to l (10) ∗ ∗ λ y � 0, have optimal solutions simultaneously that X , λ , β meet i i i�1 the KKT condition, and the partial derivatives are found for W, b, ε and set to zero. λ + β � C, 1 ≤ i ≤ l. i i TIME Journal of Advanced Transportation 9 ETC ETC Gantry Data High- Input traffic data ISTM Multiclassification Order Tensor Model Decision Figure 3: Diagram of the STM classifcation. Substituting equations (9) and (10) to obtain an unknown distribution of trafc data and insufcient a priori equivalent expression for the STM model optimization knowledge, using the distance between two tensor samples problem, to refect the current structural information. It lets K〈X ,X 〉 satisfy Mercer’s theorem and uses the Gaussian i j l l l kernel function to make the dimensional transformation, ⎛ ⎝ ⎞ ⎠ max λ − λ λ y y 〈X ,X 〉 i i j i j i j λ 2 and the model is as follows: i�1 i�1 j�1 � � � � � � � � X − X � � i j l F (11) ⎝ ⎠ ⎛ ⎞ (14) K〈X ,X 〉 �〈ψ