Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Intelligent IoT systems for traffic management: A practical application

Intelligent IoT systems for traffic management: A practical application INTRODUCTIONIntelligent Transportation Systems (ITS) are witnessing a paradigm shift in their modus operandi. Intelligence, understood as the capability of a system to assist, manage, and make decisions to improve key performance metrics, is entering a new reality. One of the reasons lies in prompt advances in traffic data acquisition [1]. This increasing amount of available data allows better and more responsible use of resources. And the origin of these data is vast. Still, in general, we can affirm that either from sensor devices or from Road Side Units (RSU), in‐vehicle communication, Vehicle‐to‐Vehicle (V2V), Vehicle‐to‐Infrastructure (V2I), Vehicle‐to‐Everything (V2X), etc. is being transformed thanks to IoT devices. These IoT devices can be embedded in practically any system or device, offering the ability to communicate, provide information, or automate tasks. Besides, these IoT devices have low energy consumption, which allows them to be integrated into battery‐powered devices, thus extending their useful life.Traffic management, as a part of traffic control, represents a relevant segment of the studies carried out in ITS. The Panel on Future Directions in Control, Dynamics, and Systems identified in [2] several challenges for control systems with a straightforward application in ITS: the notion of operating in a distributed system and the need for coordination and autonomy. Currently, ITS are distributed by nature; hence an effort is needed to visualize the system as such [3]. A distributed system is usually defined as a set of independent computing elements whose hardware and software components communicate and coordinate their actions only through passing messages, appearing to the final user as a whole system (transparency). Among others, design challenges in distributed systems include performance, robustness, and reliability, with emphasis on achieving these features in the context of communication within ITS. Regarding coordination and autonomy, Murray et al. explain in [2] that the study and development of robust control systems require more elaboration in order to realize higher level decision‐making systems. The benefits of such decisions are enormous: to enhance the efficiency of the vehicle transportation system by improving safety, reducing pollution (particle and noise), decreasing fuel consumption, and increasing service quality (e.g. with shorter vehicles’ travel and waiting times). For instance, according to the National Highway Traffic Safety Administration data [4], incorrect driver decisions account for 33% of accidents, being driver recognition the top one reason (41%). These figures are expected to be significantly reduced with proper processing of collected data, and an adequate system‐intelligence applied.For intelligent and autonomous traffic light control at intersections, researchers have explored a wide range of approaches. The general methods that have been used to create smart, autonomous control systems for signaled intersections have been diverse, such as fuzzy logic [5–7], reservation and market‐based system [8–10], neural networks [11–13], reinforcement learning [14–18], and swarm intelligence and evolutionary computation [19–21].The main problem with most of these algorithms is that they are complex to run in real‐time on IoT devices, demanding the resources of cloud computing [22]. But with the rise of Smart Cities, autonomous cars, 5G, and ITS, using the advantages offered by IoT (integration, embedded, speed, simplicity, etc.) can be a definite advantage for the development of intelligent traffic light control systems. In this paper, we face the problem of combining IoT with intelligent algorithms. Random Early Detection for Vehicles Dynamic (REDVD) is a very simple and efficient mechanism that optimizes the phases and cycles of traffic lights in signaled intersections [23]. However, REDVD outperforms other mechanisms in a stand‐alone intersection, it does not behave as expected in more complex scenarios. The reason lies in the number of configuration parameters that influence its performance. Therefore, we propose to use an evolutionary algorithm (EA) to obtain the best parameter configuration of REDVD. Then, the performance of this new enhanced version, called improved REDVD (iREDVD), is tested under complex and previously‐unknown traffic scenarios. The Key Performance Indicators (KPI) used in this study are: waiting time, trip time, travel speed, emissions (specifically, CO, CO2, HC, PMx, and NOx), and fuel consumption. In the light of our outcomes, iREDVD is a light‐by‐design method that can be deployed in IoT devices and outperforms not only REDVD but also other well‐known traffic management methods in all the KPI under study.The rest of the paper is organised as follows. In Section 2, we briefly summarise the state of the art. Section 3 describes our proposal. In Section 4, the optimization process is detailed. The methodology followed for the performance evaluation is explained in Section 5. In Section 6, we show and discuss the obtained results. The paper ends summarising the most important outcomes.RELATED WORKSWithin evolutionary computation, there are two main branches. First, we have those that use genetic algorithms (GAs) to represent the phases and timing of traffic lights as a set of chromosomes and perform optimisation directly on these chromosomes (could also be included within the fuzzy logic). Second, we have those that use GA as an optimisation algorithm to optimise a traffic light control system with a large number of parameters, which is unfeasible to optimise otherwise. The main advantage of these algorithms is that they are light to run but require a large computational load for training or for adjusting their parameters. However, this disadvantage can be mitigated by working in the cloud during a training phase and updating the algorithm easily once finished.Within the first group, it stands out the work carried out by Sánchez et al. [24], where GA is used to encode a fuzzy logic controller in the chromosomes and find the optimal parameters according to the number of cars waiting. Also, similar works on optimising traffic light timings using GA are presented in [21], [25]. An approach linking GAs with device communication (D2D) can be found in [26]. This D2D approach is used to collect information from sensors and actuators, as well as to try to reduce the response time.Concerning the second group, it is within this one that our work is encompassed, since we use a GA to optimise the traffic light control algorithm. The advantage of this method is that it allows us to shift the complexity of the search for the parameters of the control algorithm to a training scenario. This allows us to adapt the control algorithm to numerous scenarios and changes in the environment and act in advance to planned changes to be able to study the scenario beforehand. Other works use optimisation algorithms similar to GAs, such as ant colony optimization (ACO) algorithms. An ACO algorithm is a probabilistic technique for solving computational problems that can be reduced to find good paths through graphs. Within this category, the work done by Abdul Rehman et al. [27] and Kponyo Jerry et al. [28] stand out, where both solve the traffic congestion problem applying ACO.Differing from previous works in the existing literature, our work focuses on optimising an adaptive traffic light control algorithm known as REDVD by means of a GA. Our improved version of this algorithm is called iREDVD. The novelty lies in the use of GAs for the optimisation process, facilitating the task of setting the best values for the numerous configuration parameters that the algorithm has. To the best of our knowledge, whereas most works from the related literature include GAs in traffic management procedures themselves, we only use it as an offline step. Then, we demonstrate that once the parameters are set, iREDVD works successfully under unknown traffic scenarios keeping its lightness feature and without the burden of AI. Therefore, it is possible to use it in IoT devices.RANDOM EARLY DETECTION FOR VEHICLESAs seen in the previous section, there are several approaches to traffic light controlled intersection control algorithms. In this section, we will show the basis of the ITS proposed in [23], namely REDVD, which serves as the basis for this work.The operating principle of REDVD is the Random Early Detection (RED) [29] algorithm, used for congestion control in communications networks. RED is a congestion control technique extensively used in packet‐switching communication networks. It is used to detect and avoid congestion and delay at an early stage and to improve the throughput rate. RED avoids congestion by randomly discarding packets, depending on the number of packets waiting to be processed in the router. These packets can be dropped if the packet queue to be processed exceeds a certain threshold; the dropping probability increases as the queue size increases. By doing so, congestion can be avoided even before it occurs. For more information about RED, please refer to [29].Note that the concept of a queue at a router and a queue at a traffic light in a signalled intersection are analogous. As proposed in [23], the analogy with intersections occurs when, instead of discarding packets, we increase the green‐time when many vehicles are queuing to cross an intersection. With this, we can get more vehicles crossing the intersection. The first adaptation of RED for traffic control in urban scenarios was introduced in [23] and was called RED for Vehicles (REDV). REDV adapts the RED algorithm to the particularities of vehicles and signalled intersections. The idea was as follows. In REDV, the number of vehicles waiting to cross at each intersection branch is similar to the number of packets queued for processing in a router. Instead of dropping a packet with a certain probability that depends on the number of packets to be processed, what REDV does is to increase or decrease the green time of each intersection branch with a probability that depends on the number of cars waiting to cross. Thus, when there are many cars on a branch, the green time for that branch increases frequently. Otherwise, when there are too few cars waiting to cross, the green time is likely reduced. In any case, the total cycle of the intersection was a fixed value. REDV demonstrated significant performance as an adaptive control system, managing incipient congestion at isolated intersections proactively.To deal with changes in the traffic flow rate i.e. in the number of vehicles crossing the intersection per unit of time, it was necessary to adapt the total cycle. The idea was to provide long cycles to accommodate a large volume of traffic and short cycles when the volume of traffic was small. Consequently, the same authors introduced in [23] a new version called REDV Dynamic (REDVD). Although the performance in terms of traffic metrics was improved, this adaptation added new adjustable parameters to the configuration, parameters that need to be appropriately tuned for the algorithm's optimal performance. Table 1 includes all the configuration parameters of REDVD.1TABLEREDVD parametersParameterDescriptionminthThe minimum threshold of vehicles to increase the green‐time phases probabilistically.maxthThe maximum threshold of vehicles to increase the green‐time phases probabilistically.deltaThe increase/decrease of the green‐time phase, either because it is above/below the maxth/minth thresholds, respectively or because of a probabilistic increase when it is between these thresholds.min_greentimeThe minimum time for a green‐time phase.max_greentimeThe maximum time for a green‐time phase.limincNumber of consecutive times the green‐time phase must be increased, taking into account both arteries to increase the cycle for all phases.limdecNumber of consecutive times the green‐time phase must be decreased, taking into account both arteries to decrease the cycle for all phases.delta_cycleThe increase/decrease of the cycle for all phases.min_cycleThe minimum cycle for all phases.max_cycleThe maximum cycle for all phases.wqFactor that determines the weight of the historical value versus the current value in the queue length.maxpFactor used in RED algorithm.These new parameters contribute to higher flexibility in traffic control, being adaptive to continuously changing traffic conditions. However, the process of optimising this large set of parameters is not trivial. Indeed, the values used in [23] for configuring REDVD were obtained empirically. For this reason, one of the goals of this work is to find out the optimal configuration able to offer the best performance in terms of the average waiting time per intersection for each vehicle. This optimisation will be carried out using an EA, more specifically, employing a GA, which is the most popular type of EA. This version optimised by means of a GA is called iREDVD. The entire optimisation process is described in the following section.OPTIMISATION PROCESSA GA consists of a series of mechanisms inspired by the theory of evolution to optimise a set of parameters within a problem. It allows us to optimise that problem, a priori complicated, using a set of light procedures with a low computation load, which derives in a fast parameter optimisation [30], [31].The general procedure consists of four phases: initialize population, fitness calculation, selection, and crossover, which form a generation. The overall optimisation process will take as many generations as needed. A population is a set of individuals. The number of genes that each individual contains is determined by the number of parameters to be optimized. In our case, the parameters are included in Table 1. And each gene contains the coded information of each parameter; e.g. in our case, the maxp parameter will be a float number between 0 and 1. As an example, in a problem with six binary parameters to optimise (such as the knapsack problem [32]), an example of population would be the one shown in Figure 1. In this figure, it can be seen a population of four individuals (I1 to I4), each one with six parameters to optimise (P1 to P6), and each of these parameters is coded in binary (0 or 1) in each gene.1FIGUREExample of a population of a GA, which consists of four individuals and six binary parameters to be optimisedFor each generation, the best‐adapted individuals to the environment are selected so that the next generation individuals contain their genes. The meaning of “better adapted to the environment” is given by the fitness function, which numerically measures how well an individual can adjust to the problem being optimised. For example, in tasks of humanoid robots learning to walk where genes encode the movement of joints and muscles, the fitness function will be the distance travelled, and those individuals who have gone the furthest will be most likely to be selected. Next, we describe the four phases of the general optimisation procedure in detail (see Figure 2).2FIGUREA general GA procedure. The convergence condition can be different, either several entire generations or a fitness value improvement after a few generationsInitialize populationThe procedure starts by initialising the population of individuals with random genes (although other approaches can be followed in order to reduce the convergence time [33], [34]). The population's size is a crucial factor for the correct convergence of the GA because it causes more diversity in the population and ensures that a large amount of the parameter space is being explored.Nevertheless, there is an important trade‐off between the population's size, genetic diversity, and convergence speed. If the population size is considerable, then we have great genetic diversity by exploring a wide range of the parameter space at random. Still, it will take a long time to get the fitness of all the individuals in the population. The size of the population depends strongly on the problem studied as to cover a considerable range of the parameters, it is necessary to increase the population's size exponentially. A basic (minimum) recommendation is to have twice more individuals than the parameters studied. Typical population sizes can be between 20 and 100 individuals [30], [35], [36].Fitness calculationEach individual in the population is evaluated, and its level of fitness is obtained with its parameter settings. For instance, if we are designing a robot that is able to walk, the fitness of each individual would be how far it can walk [37], or in the knapsack problem [32] the adjustment would be the benefit obtained by the objects inside the knapsack that minimize the weight. In our case, it will be the average waiting time per intersection for each vehicle.SelectionOnce all the individuals in the population have been evaluated, the next step is to select the subset of individuals that are best‐adapted to the environment. There are various mechanisms for selecting the best individuals [38–40] e.g. roulette wheel selection, Boltzmann selection, tournament selection, rank selection, steady‐state selection, or Elitism selection, among others. One of the simplest and most effective methods is to select the percentage of individuals who have the highest level of fitness. With this mechanism, the subset of best‐adapted individuals to the environment will be obtained in the current generation. These selected individuals are known as parents and will be the ones from whom the children are generated.The percentage of individuals selected is crucial for the correct functioning of the GA. A very high value will prevent rapid convergence, but a very low value will nevertheless lead to low genetic diversity and a high possibility of converging towards a local minimum. Typical values are between 5% and 10%, depending on the population size [30], [35], [36]. The remaining individuals (children) will be generated from these parents to obtain the total population employing crossover and mutation mechanisms.Although there are very sophisticated methods for the selection mechanism, it is recommended [41] to use rank selection in the first generations to quickly obtain preliminary results. More sophisticated techniques can be found in [40].CrossoverCrossover is a mechanism by which new individuals (children) are generated from two or more parents’ direct mating. This mechanism is done to obtain children with the genes of those parents with a good fitness value. The basic operation of this mechanism consists of randomly selecting genes from different parents to generate new children. There are various crossover mechanisms, such as the selection of a single crossover point, the selection of multiple crossover points, or the gene‐by‐gene combination [42–45]. The mechanism used in this work is the single crossover point due to its simplicity and good results [44]. In this mechanism, a crossing point is selected. Then, from the beginning of the genes to the crossing point is transferred from one parent and the rest is transferred from the second parent.The crossover rate indicates how often parents are crossed over to get new children, so it controls what percentage of the children are exact copies of the parents and which ones are crossed over. That is, if the crossover rate is 100%, all the children are obtained by crossing the parents. However, if the crossover rate is 0%, all the children are exact copies of the parents. This 0% does not mean that the new generation will be identical to the parents since, after the crossover, the genes of these created children can be mutated. The crossover rate should generally be high, between 80% and 95% [30], [35], [36].MutationIn order to obtain a high genetic diversity, mutation mechanisms can be applied to a small percentage of genes in the population. This mutation means that the value of each gene can vary randomly with a small probability. The mutation prevents the GA from converging to a local optimum, as it allows other combinations of genes that can achieve better fitness levels to be explored at random [46], [47]. However, this mutation should not occur very often, since then it would become a random search.The mutation rate indicates how often an individual's gene will be mutated. If the mutation rate is 0%, the population created after the crossover does not mutate. However, if the mutation rate is 100%, all the offspring's genes are changed randomly, within a range of change, which depends on the scenario. This margin of change should be less than 20% of the allowed range for each gene [47]. In turn, the mutation rate should remain around 0.1% and 1%, to enable the exploration of a wider space of parameters, obtaining genetic diversity, but without falling into exploring the entire parameter space at random [48], [49]. In addition, it is advisable to decrease the mutation rate, as well as the margin of change of the parameters, as the generations go by, in order to obtain a rapid convergence [30], [35], [36]. Figure 2 shows the general procedure followed by the GA until the convergence condition is satisfied.For iREDVD, we choose a population size of 256 individuals, 16 selected parents, a crossover rate of 80%, a mutation rate of 0.1%, and up to 20 generations.MATERIALS, SCENARIOS, AND METHODSThe algorithms under study have been evaluated via computer simulations. All methods were programmed in Python v3.7. To compare the different proposals, these scripts were coupled to the microscopic traffic simulator SUMO [50] v1.0.1 with TraCI (Traffic Control Interface). SUMO is a valuable tool as it allows flexible simulations in customizable scenarios and editable traffic patterns. The CPU used was an Intel Xeon 16 cores @ 2.6GHz. Three scenarios were used in this work, one for training and two for testing. The details of each scenario can be seen in Table 2. These three scenarios share some characteristics and differ in others, which are described in the next subsections.2TABLECharacteristics of the training and the testing scenariosScenarioNumber of intersectionsIntersection layoutDistance between intersectionsSimulation durationTrain164×4300m10hTest151×5200m12hTest210010×10250m12hOn the one hand, the training scenario consists of a grid of 4×4 intersections (in total 16 intersections) with 2 lanes for each direction, and a distance between intersections of 300m (see Figure 3). The vehicle flow rate fluctuated over time, remaining constant for one hour and following the pattern shown in Figure 4 afterward. The low flow rate was 600 vehicles/hour/branch, the medium flow rate 1200 vehicles/hour/branch, and the high flow rate 1600 vehicles/hour/branch. The north (N) and south (S) branches had identical flow rates, and the same applies to the east (E) and west (W) branches. As shown in Figure 4, there are times when the flow rate is symmetric (e.g. hours 0, 1, and 2) and others when is asymmetric (e.g. hours 3, 4, and 5).3FIGURESimulated topology for the training scenario: Manhattan 4×4 network with 300 m between each intersection4FIGUREVehicle flow rate per branch used in the training scenarioOn the other hand, the testing scenarios are used to evaluate the performance in new conditions i.e. situations the algorithm has not been trained for. Simulations run in the testing scenarios use the optimal parameters obtained previously in the training. The test scenario 1 consists of a large avenue of 5 intersections (1×5) separated by 200 m between each intersection, as shown in Figure 5(a). The test scenario 2 consists on a 10×10 Manhattan grid, with a total of 100 intersections, as can be seen in Figure 5(b).5FIGURESimulated topology for the testing scenarios: (a) 1×5 network with 200m between each intersection; (b) Manhattan 10×10 network with 250m between each intersectionBoth test scenarios have the same vehicle flow rate, depicted in Figure 6. The vehicle flow rate is more chaotic than in the training scenario, hence, stressing the different algorithms under study. As depicted in Figure 6, the constant vehicle flow rate stretches are maintained for 15 min, and the low, medium, and high flow rates are 700, 1000, and 1800 vehicles/hour/branch, respectively. With these test scenarios, we check that iREDVD algorithm is not overfitted to the training scenario.6FIGUREVehicle flow rate per branch used in the testing scenariosIn all scenarios, the selected vehicle distribution was based on the fleet of vehicles of the city of Madrid (Spain) [51]. This distribution can be seen in Table 3. Also, in order to obtain both fuel consumption and pollutant emissions from vehicles, the pollution model used was the HBEFA [52]. HBEFA is an emission factor database for road vehicles and provides emission factors (hot start, cold start, evaporation) for all regulated and important unregulated air pollutants as well as for fuel consumption.3TABLEType of vehiclesVehiclePercentageFuelCar30%GasolineCar40%DieselMotorcycle10%GasolineMoped10%GasolineVan5%DieselBus5%Average of all fuel typesSimilarly, each intersection had four traffic lights that control each branch of the intersection (North, South, East, West) in all scenarios. Each intersection had two lanes on each branch. An example of an intersection is depicted in Figure 7. For simplicity, only the straight forward and the right‐turn movements were allowed (see Figure 7). This constrain can be found in numerous articles [53–57]. Mainly, it allows a quickly/simple implementation of a new concept to study the advantages and disadvantages it can offer. When adding the left turn, only one new phase needs to be incorporated in the whole cycle, thus allowing the vehicles to turn left. Therefore, we consider that the selected configuration is appropriate to show the benefits of the optimization process using the GA and the robust performance of iREDVD. The phases of the traffic lights were coupled (North with South and East with West) and were opposite (North and South vs. East with West) between each couple. In other words, if the traffic lights in the North and South branches were green, it means that East and West branches were red. Figure 8 illustrates an example of a cycle. The yellow‐time interval of the traffic lights was set to 3 seconds and the all‐red time to 2 seconds, for safety and realism. The safety distance used between cars was 1.5 m. A Poisson distribution was used to generate the traffic flows.7FIGUREAn example of a four‐way signaled intersection, as used in this study8FIGUREExample of a cycle in a traffic light. N≡North; S≡South; W≡West; E≡EastIn order to compare the performance of iREDVD with other algorithms for traffic control, we also implement and evaluate the performance of a system with fixed cycle (FX) time and with pre‐set offsets between consecutive intersections, known as Green Wave (GW). This offset is obtained as the time that a vehicle needs to move between two intersections once the green phase starts (i.e. in a coordinated manner like a green wave). Three different versions were studied for the FX and GW algorithms, which also modify the total cycle time. The cycle times of 30, 45, and 60 seconds were studied. The nomenclature used for each algorithm is expressed as the algorithm used and the cycle time; so, for example, FX45 corresponds to the fixed algorithm with a cycle time of 45 seconds, equally divided between the branches. In all cases, 10 experiments were run for each algorithm under study in each scenario, obtaining the mean value and its standard deviation.Finally, the optimised metric was the normalised waiting time (the waiting time per intersection), although trip time, average speed, CO, CO2, HC, PMx and NOx emissions, and fuel consumption were also studied. The normalised waiting time eliminates the influence of the number of intersections crossed by vehicles in each flow. Therefore, we obtain for each vehicle the ratio between the total waiting time and the number of intersections crossed during the travel.PERFORMANCE EVALUATION RESULTSTraining scenarioFigure 9 illustrates the value of the normalised waiting time of the vehicles per intersection during the optimisation process carried out by the GA in the training phase. As it is shown, after about five generations, the GA achieves promising values for most of the simulated individuals. Although the simulation could have been stopped at that time, we left it more to find a robust set of parameters, which over time are able to demonstrate their superiority over the rest. The time required for each of the simulations in the optimisation process for the test scenario was approximately 1 min. This means that simulating the 256 individuals for 21 generations (the initial one plus the next 20 generations) on a server with 16 processing cores (i.e. 16 simulations were simulated in parallel) required approximately 48 hours. Each simulation was repeated 10 times, and the mean value and standard deviation of each variable under study were obtained. Once the best parameter configuration has been obtained, the execution of the traffic light control algorithm is almost instantaneous since it no longer requires any artificial intelligence algorithm since it is based on a light, fast, and quick method that can be rapidly executed by low power IoT devices. Further information can be found in [23].9FIGUREOptimization process. “Best fitness” is the individual with the lowest normalised waiting time and “Average fitness” is the average normalised waiting time of all population. X‐Axis represents the generationsThe values of the optimised parameters after training are shown in Table 4. The optimisation process shows very interesting values. The parameter limdec indicates how many times the green time of the traffic lights is reduced so that the total cycle of the intersection is reduced. The fact that limdec is 1, means that if a reduction in traffic is detected, even a slight one, it is worthwhile to shorten the total cycle time (and therefore the green time) quickly. However, the opposite is true for the increase in total cycle time. The parameter liminc controls the increase in the total cycle time of the traffic lights when traffic increases. With a high value of 5, it means that iREDVD should be very sure before increasing the total cycle time.4TABLEOptimised parameters and valuesParameterOptimal valueminth10maxth25min_greentime15max_greentime60min_cycle45max_cycle130delta10delta_cycle15maxp0.85wq0.7limin5limdec1The results obtained in the training scenario (see Table 5) corroborate that iREDVD has an improvement of more than 50% (reduction in almost 25 seconds) in the average waiting time of the vehicles per intersection when compared to the traditional algorithms (Fixed 30, Fixed 45, Fixed 60, GreenWave 30, GreenWave 45, and GreenWave 60).5TABLETrain resultsAlgorithmAvg. norm. waiting time (s)Avg. trip time (s)Avg. speed (m/s)Avg. CO emissions (mg)Avg. CO2 emissions (g)Avg. HC emissions (mg)Avg. PMx emissions (mg)Avg. NOx emissions (mg)Avg. Fuel consumption (ml)TraditionalFX3051,5 ± 0,11213,96 ± 0,237,25 ± 0,016642,68 ± 21,51783,90 ± 4,49131,83 ± 1,0862,75 ± 0,563988,72 ± 31,07315,59 ± 1,79FX4548,3 ± 0,04201,81 ± 0,097,61 ± 0.026184,13 ± 24,62739,80 ± 3,07122,93 ± 0,2258,39 ± 0,143749,9 ± 15,14297,69 ± 1,11FX6051,64 ± 0,06204,76 ± 0,107,53 ± 0.016463,22 ± 50,19739,23 ± 2,78124,46 ± 1,4258,87 ± 0,693754,17 ± 26,77297,47 ± 1,19GW3046,13 ± 0,08203,29 ± 0,117,65 ± 0.046016,83 ± 36,41751,81 ± 2,05122,43 ± 0,3858,54 ± 0,163788,61 ± 14,89302,59 ± 0,71GW4550,02 ± 0,04206,29 ± 0,077,64 ± 0.016427,96 ± 20.95748,24 ± 2,40125,86 ± 0,8959,63 ± 0,433796,72 ± 19,92301,2 ± 0,99GW6047,4 ± 0,07199,45 ± 0,077,82 ± 0.026178,43 ± 18,74725,66 ± 5,34120,27 ± 1,0656,98 ± 0,523665,26 ± 34,72291,95 ± 2,11REDV original62,07 ± 4,01234,22 ± 7,617,22 ± 0,097560,28 ± 363,68819,66 ± 15,15143,19 ± 5,0267,75 ± 2,214221,82 ± 96,234221,82 ± 96,23REDVD original29,58 ± 0,33184,06 ± 0,678,27 ± 0,034708,8 ± 27,36723,41 ± 1,67107,39 ± 1,0052,51 ± 0,513573,78 ± 16,833573,78 ± 16,83iREDVD21,37 ± 1,86167,2 ± 2,919,14 ± 0,164074,01 ± 135,67672,11 ± 10,1396,01 ± 2,3546,96 ± 1,153275,5 ± 59,963275,5 ± 59,96Improvement of iREDVD vs traditional.*Abs‐24,76‐32,251,32‐1942,82‐53,54‐24,27‐10,02‐389,76‐21,83%53,6716,1616,8732,287,3720,1817,5810,637,47Improvement of iREDVD vs REDV orig.Abs‐40,7‐67,021,92‐3486,27‐147,54‐47,19‐20,79‐946,32‐59,92%65,5728,6126,5946,1118,0132,9630,6822,4218,15Improvement of iREDVD vs REDVD orig.Abs‐8,21‐16,860,87‐634,79‐51,29‐11,39‐5,55‐298,28‐20,85Improvement %27,759,1610,5113,487,0910,6010,568,347,16*Improvement when compared to the best traditional algorithm.Note: mean ± std of 10 testCompared with the original REDV algorithm proposed in [23], our improvement is still visible. This reinforces the fact that adjusting the parameters is necessary for the proper functioning of the algorithm. Finally, compared to the original REDVD variant also presented in [23], there is an improvement of 27%, which means a reduction of more than 8 seconds in the average normalised waiting time. Regarding other key performance metrics, such as average travel time, average speed, average CO, CO2, HC, PMx and NOx emissions, and average fuel consumption, we can see that the iREDVD improves in all of these KPI in the range of 7%–45%; most notably an increase in speed of 16%–26%, a decrease in average CO emissions of 13%–46%, and a decrease in average fuel consumption of 7%–18%, compared to traditional algorithms.Testing scenariosAfter the training, we stressed the algorithm in the testing scenarios using the optimised values for the configuration parameters. It can be seen from the results shown in Tables 6 and 7 that improvements are very similar to those obtained in the training scenario. This corroborates that iREDVD is able to adjust to unknown conditions (scenarios that it was not trained for), showing its robustness to be applied in real deployments.6TABLETest 1 resultsAlgorithmAvg. norm. waiting time (s)Avg. trip time (s)Avg. speed (m/s)Avg. CO emissions (mg)Avg. CO2 emissions (g)Avg. HC emissions (mg)Avg. PMx emissions (mg)Avg. NOx emissions (mg)Avg. Fuel consumption (ml)TraditionalFX3021,22 ± 0,25169,9 ± 1,217,34 ± 0,025395,04 ± 65,59597,56 ± 4,11102,91 ± 0,7348,99 ± 0,323073,18 ± 22,29240,61 ± 1,6FX4519,2 ± 0,03150,47 ± 0,167,72 ± 0,014756,42 ± 23,49551,99 ± 1,5492,4 ± 0,2744,06 ± 0,152814,95 ± 5,68222,12 ± 0,58FX6021,74 ± 0,01153,05 ± 0,087,58 ± 0,025047,54 ± 20,13555,22 ± 3,0295,51 ± 0,845,26 ± 0,422844,12 ± 23,67223,56 ± 1,17GW3017,68 ± 0,39154,05 ± 1,797,81 ± 0,044586,92 ± 69,72558,36 ± 3,4991,28 ± 0,9143,83 ± 0,412829,06 ± 20,56224,73 ± 1,4GW4519,25 ± 0,03151,67 ± 0,157,83 ± 0,054803,28 ± 19,67554,43 ± 3,2792,62 ± 0,7444,21 ± 0,382825,08 ± 21,72223,09 ± 1,28GW6021,36 ± 0,02151,66 ± 0,127,75 ± 0,034919,4 ± 18,56555,01 ± 3,0993,9 ± 1,1944,67 ± 0,552839,66 ± 22,56223,32 ± 1,19REDV original25,38 ± 0,71152,55 ± 2,947,73 ± 0,054663,73 ± 120,77547,49 ± 5,1591,42 ± 1,9143,61 ± 0,862789,18 ± 34,95220,32 ± 2,14REDVD original17,66 ± 0,47172,52 ± 2,997,49 ± 0,084771,9 ± 84,21580,91 ± 7,7095,48 ± 1,8645,93 ± 0,912953,28 ± 51,28233,81 ± 3,08iREDVD11,6 ± 0,27132,6 ± 1,658,38 ± 0,063406,27 ± 66,39511,34 ± 3,0777,09 ± 1,5637,62 ± 0,72536,01 ± 23,01205,74 ± 1,36Improvement of iREDVD vs traditional.*Abs‐6,08‐17,870,55‐1180,65‐40,64‐14,19‐6,21‐278,94‐16,38%34,3811,877,0225,737,3615,5414,169,917,37Improvement of iREDVD vs REDV orig.Abs‐13,78‐19,950,65‐1257,46‐36,14‐14,33‐5,99‐253,17‐14,58%54,2913,078,4026,966,6015,6713,739,076,61Improvement of iREDVD vs REDVD orig.Abs‐6,06‐39,920,89‐1365,63‐69,57‐18,39‐8,31‐417,27‐28,07Improvement %34,3123,1311,8828,6111,9719,2618,0914,1312,0*Improvement compared to the best traditional algorithm.Note: mean ± std of 10 test7TABLETest 2 resultsAlgorithmAvg. norm. waiting time (s)Avg. trip time (s)Avg. speed (m/s)Avg. CO emissions (mg)Avg. CO2 emissions (g)Avg. HC emissions (mg)Avg. PMx emissions (mg)Avg. NOx emissions (mg)Avg. fuel consumption (ml)TraditionalFX30139,05 ± 0,04428,63 ± 0,436,49 ± 0,0114273,45 ± 55,481508,16 ± 10,14266,46 ± 1,25125,60 ± 0,337738,98 ± 7,48608,09 ± 0,87FX45131,08 ± 0,07400,10 ± 0,856,85 ± 0,0213064,96 ± 44,841415,19 ± 4,25246,19 ± 0,84116,05 ± 1,157274,22 ± 10,44569,66 ± 2,15FX60134,60 ± 0,21399.23 ± 0,376,90 ± 0,0513388,89 ± 14,351396,32 ± 7,19246,41 ± 3,44115,58 ± 0,887181,57 ± 8,21561,98 ± 1,49GW30122,31 ± 0,18403,24 ± 0,596,69 ± 0,0512634,05 ± 43,011443,93 ± 13,25245,45 ± 2,25116,67 ± 0,917738,98 ± 4,97581,48 ± 0,45GW45136,20 ± 1,01407,89 ± 0,896,97 ± 0,0213394,84 ± 24,121435,91 ± 6,84252,15 ± 0,98118,97 ± 1,017274,22 ± 11,02577,83 ± 1,04GW60122,79 ± 0,50382,22 ± 1,217,28 ± 0,0112415,55 ± 33,211348,19 ± 9,50231,86 ± 1,21109,01 ± 2,457185,77 ± 6,81542,51 ± 0,76REDV original140,69 ± 2,15421,36 ± 8,165,98 ± 0,1115584,88 ± 33,861550,40 ± 3,46308,69 ± 3,48130,18 ± 2,387911,04 ± 15,01624,14 ± 1,24REDVD original99,22 ± 8,33374,45 ± 5,748,12 ± 0,188847,37 ± 76,841321,84 ± 2,62212,11 ± 1,1299,01 ± 1,226812,88 ± 21,66539,69 ± 4,18iREDVD21,45 ± 0,24286,85 ± 0,159,39 ± 0,116764,93 ± 27,111286,49 ± 0,96178,71 ± 1,5289,27 ± 0,696249,30 ± 25,21517,07 ± 0,59Improvement of iREDVD vs traditional.*Abs‐101,34‐95,372,11‐5650,62‐61,70‐53,15‐19,74‐932,27‐25,44%82,4624,9528,9845,514,5822,9218,1112,984,69Improvement of iREDVD vs REDV orig.Abs‐119,24‐134,533,41‐8829,95‐263,91‐128,98‐40,91‐1661,74‐107,07%84,7531,9357,0256,6217,0242,1131,4321,0117,15Improvement of iREDVD vs REDVD orig.Abs‐77,77‐87,601,27‐2082,44‐35,35‐33,39‐9,74‐563,58‐22,62Improvement %78,3823,3916,6423,542,6715,749,838,274,19*Improvement compared to the best traditional algorithm.Note: mean ± std of 10 testFor example, iREDVD achieves a reduction in average normalised waiting time of 34%–49% in Test 1 and 78%–82% in Test 2, compared to other control techniques, which is equivalent to reducing the average waiting time of vehicles at each intersection crossed by more than 6–100 seconds. This improvement is also reflected in other metrics, such as the reduction of average CO emissions by around 25% or a reduction in average fuel between 6%–17%.Final remarksThe excellent performance of the optimised iREDVD is due to the ability to adapt to road conditions. In Figure 10(a) and Figure 10(b), it can be observed how iREDVD proactively adapts the cycle time according to the simulated traffic flow rate, not only in training Figure 10(a), but also under unknown conditions in the testing Figure 10(b). The cycle increases when the flow rate of vehicles passing through the intersection increases, and vice versa. In addition, we can see in Figure 10(a) and Figure 10(b) that the time assigned to each branch is independent, since when the traffic is asymmetric [see Figure 10(b) hour 4] iREDVD obtains a time for each branch that is different, self‐adjusting to the traffic conditions. This independence allows iREDVD a better time distribution and a reduction of waiting time.10FIGUREVehicular flow rate simulated, time for each branch, and total cycle for intersection 1 vs. simulation time: (a) train scenario; (b) test scenarioCONCLUSIONWith the growing number of vehicles, the massification of large cities, and the rise of the IoT, there is a growing need for orchestration for smart cities. The use of ITS allows for efficient traffic control when applied to traffic‐light managed intersections, improving traffic flow, and diminishing pollution and fuel use. In this work, we have presented the optimisation and performance of the iREDVD ITS method using an EA as the optimisation tool. iREDVD is a lightweight, optimised, and fast algorithm capable of being executed in IoT devices with limited requirements, whereas controls signaled intersections in a very efficient way. We have shown that iREDVD outperforms traditional traffic light control techniques, as well as its different previous versions. Compared to traditional control techniques, it showed reductions between 24 to 100 seconds in the waiting time of vehicles per intersection travelled through, equivalent to between 50% and 80% reduction in the waiting time of vehicles at traffic lights. Compared to the original REDVD, iREDVD reduces the waiting time at each intersection by more than 27%. Besides, iREDVD reduces both the pollutant emissions and fuel consumption between 7%–38% and between 7%–14%, respectively.In conclusion, the use of these ITS will allow future smart cities to orchestrate the different actors better, allowing for increased safety, reduced pollution, and optimised transportation systems. As future work, research is being carried out on the integration of ITS with 5G technology, to obtain a fully connected traffic management ecosystem.ACKNOWLEDGEMENTSThis work was supported by the AEI/FEDER, UE project grant TEC2016‐76465‐C2‐1‐R (AIM), by DGT‐Ministerio del Interior, project grant SPIP2017‐02230 (STREET), and by a pre‐doctoral contract for the formation of the research personnel financed by Consejería de Empleo, Universidades, Empresa y Medio Ambiente of the CARM, through the Fundación Séneca ‐ Agencia de Ciencia y Tecnología of the Region of Murcia, Spain.REFERENCESZhang, J., et al.: Data‐driven intelligent transportation systems : A survey. IEEE Trans. Intell. Transp. Syst. 12 (4), 1624–1639 (2011)Murray, B.R.M., et al.: The Panel on Future Directions in Control, Dynamics, and Systems. IEEE Control Syst. Mag. 23 (2), 20–33 (2003)Cano, M.‐.D., et al.: Coordination and agreement among traffic signal controllers in urban areas. In: International Conference on Transparent Optical Networks, 2016.Singh, S.: Critical reasons for crashes investigated in the National Motor Vehicle Crash Causation Survey. Natl. Highw. Traffic Saf. Adm.1–2 (2015). Washington, DC. https://crashstats.nhtsa.dot.gov/Api/Public/ViewPublication/812115Pappis, C.P., Mamdani, E.H.: A fuzzy logic controller for a traffic junction. IEEE Trans. Syst. Man Cybern. 143(1–4), 73–97 (1977)Chiu, S., Chand, S.: Adaptive traffic signal control using fuzzy logic. In: 1st IEEE Regional Conference on Aerospace Control Systems, AEROCS 1993 ‐ Proceedings, 1993.Niittymäki, J., Pursula, M.: Signal control using fuzzy logic. Fuzzy Sets Syst., 16(1), 11–22 (2000)Dresner, K., Stone, P.: Multiagent traffic management: A reservation‐based intersection control mechanism. In: Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2004.Vasirani, M., Ossowski, S.: A market‐inspired approach to reservation‐based urban road traffic management. In: Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, 2009.Li, Z., et al.: Modeling reservation‐based autonomous intersection control in VISSIM. Transp. Res. Rec. 2381 (1), 81–90 (2013)Wei, C.H.: Analysis of artificial neural network models for freeway ramp metering control. Artif. Intell. Eng. 15(3), 241–252 (2001)Srinivasan, D., Choy, M.C., Cheu, R.L.: Neural networks for real‐time traffic signal control. IEEE Trans. Intell. Transp. Syst. 7(3), 261–272 (2006)Tubaishat, M., Shang, Y., Shi, H.: Adaptive traffic light control with wireless sensor networks. In: 4th Annual IEEE Consumer Communications and Networking Conference, CCNC 2007.Mousavi, S.S., Schukat, M., Howley, E.: Traffic light control using deep policy‐gradient and value‐function‐based reinforcement learning. IET Intell. Transp. Syst. 11(7), 417–423 (2017)Genders, W., Razavi, S.: Evaluating reinforcement learning state representations for adaptive traffic signal control. Procedia Comput. Sci. 130, 26–33 (2018)Liang, X., et al.: A deep reinforcement learning network for traffic light cycle control. IEEE Trans. Veh. Technol., 68 (2), 1243–1253 (2019)Balaji, P.G., German, X., Srinivasan, D.: Urban traffic signal control using reinforcement learning agents. IET Intell. Transp. Syst. 4(3), 177–188 (2010)Arel, I., et al.: Reinforcement learning‐based multi‐agent system for network traffic signal control. IET Intell. Transp. Syst. 4(2), 128–135 (2010)Sánchez, J., Galán, M., Rubio, E.: Applying a traffic lights evolutionary optimization technique to a real case: ‘Las Ramblas’ area in Santa Cruz de Tenerife. IEEE Trans. Evol. Comput. 12(1), 25–40 (2008)Garcia‐Nieto, J., Olivera, A.C., Alba, E.: Optimal cycle program of traffic lights with particle swarm optimization. IEEE Trans. Evol. Comput. 17(6), 823–839 (2013)Segredo, E., et al.: Optimising Real‐World Traffic Cycle Programs by Using Evolutionary Computation. IEEE Access, 7, 43915–43932 (2019)McKenney, D., White, T.: Distributed and adaptive traffic signal control within a realistic traffic simulation. Eng. Appl. Artif. Intell. 26 (1), 574–583 (2013)Sanchez‐Iborra, R., Cano, M.‐.D.: On the similarities between urban traffic management and communication networks: Application of the random early detection algorithm for self‐regulating intersections. IEEE Intell. Transp. Syst. Mag. 9 (4), 48‐61 (2017)Sánchez‐Medina, J.J., Galán‐Moreno, M.J., Rubio‐Royo, E.: Traffic signal optimization in La Almozara District in Saragossa under congestion conditions, using genetic algorithms, traffic microsimulation, and cluster computing. IEEE Trans. Intell. Transp. Syst. 11(1), 132–141 (2010)Teo, K.T.K., Kow, W.Y., Chin, Y.K., Optimization of traffic flow within an urban traffic light intersection with genetic algorithm. In: Proceedings ‐ 2nd International Conference on Computational Intelligence, Modelling and Simulation, CIMSim 2010.Tang, C., et al.: Traffic signal phase scheduling based on device‐to‐device communication. IEEE Access 6, 47636–47645 (2018)Rehman, A., et al.: Vehicular traffic optimisation and even distribution using ant colony in smart city environment. IET Intell. Transp. Syst. 12 (7), 594–601 (2018).Jerry, K., et al.: NetLogo implementation of an ant colony optimisation solution to the traffic problem. IET Intell. Transp. Syst. 9(9), 862–869 (2015)Floyd, S., Jacobson, V.: Random early detection gateways for congestion avoidance. IEEE/ACM Trans. Netw. 1 (4), 397–413 (1993)Srinivas, M., Patnaik, L.M.: Genetic Algorithms: A Survey. Computer 27(6), 17–26 (1994)Roberge, V., Tarbouchi, M., Labonte, G.: Comparison of parallel genetic algorithm and particle swarm optimization for real‐time UAV path planning. IEEE Trans. Ind. Informatics 9 (1), 132–141 (2013)Chu, P.C., Beasley, J.E.: A genetic algorithm for the multidimensional knapsack problem. J. Heuristics 4(1), (1998)Viveros‐Jiménez, F., León‐Borges, J.A., Cruz‐Cortés, N.: An adaptive single‐point algorithm for global numerical optimization. Expert Syst. Appl. 41 (3), 877–885 (2014)Toǧan, V., Daloǧlu, A.T.: An improved genetic algorithm with initial population strategy and self‐adaptive member grouping. Comput. Struct. 86 (11–12), 1204‐1218 (2008)Johnson, J.M., Rahmat‐Samii, Y.: Genetic algorithms in engineering electromagnetics. IEEE Antennas Propag. Mag. 39 (4), 7–21 (1997)Venkatesh, P., Gnanadass, R., Padhy, N.P.: Comparison and application of evolutionary programming techniques to combined economic emission dispatch with line flow constraints. IEEE Trans. Power Syst. 18 (2), 688–697 (2003)Dip, G., Prahlad, V., Kien, P.D.: Genetic algorithm‐based optimal bipedal walking gait synthesis considering tradeoff between stability margin and speed. Robotica, 27(3), 355–365 (2009)Goldberg, D.E., Deb, K.: A comparative analysis of selection schemes used in genetic algorithms. 1, 69–93 (1991)Blickle, T., Thiele, L.: A comparison of selection schemes used in evolutionary algorithms. Evol. Comput. 4 (4), 361–394 (1996)Jebari, K., Madiafi, M.: Selection methods for genetic algorithms. Int. J. Emerg. Sci. 3, 333–344 (2013)Reeves, C.R., Rowe, J.E., Genetic Algorithms—Principles and Perspectives. Kluwer, Boston (2002)Kora, P., Yadlapalli, P.: Crossover operators in genetic algorithms: A review. Int. J. Comput. Appl. 6 (1), (2017)Magalhães‐Mendes, J.: A comparative study of crossover operators for genetic algorithms to solve the job shop scheduling problem. WSEAS Trans. Comput. 12 (4), 164–173 (2013)Spears, W.M., Anand, V.: A study of crossover operators in genetic programming. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer, Berlin (1991)Abdoun, O., Abouchabaka, J.: A comparative study of adaptive crossover operators for genetic algorithms to resolve the traveling salesman problem. 31 (11), 975–8887 (2012)Srinivas, M., Patnaik, L.M.: Adaptive probabilities of crossover and mutation in genetic algorithms. IEEE Trans. Syst. Man Cybern. 24 (4), 656–667 (1994)Back, T.: Optimal mutation rates in genetic search. In: Proceedings of the fifth international conference on genetic algorithms, 1993, 2–8.Hesser, J., Mäinner, R.: Towards an optimal mutation probability genetic algorithms. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer, Berlin (1991)Greenwell, R.N., Angus, J.E., Finck, M.: Optimal mutation probability for genetic algorithms. Math. Comput. Model. 21(8), 1–11 (1995)Lopez, P.A., et al.: Microscopic Traffic Simulation using SUMO. In: 21st International Conference on Intelligent Transportation Systems (ITSC), 2018, 2575–2582.“Madrid's Vehicle Fleet. 2018. [Online]. Available: bit.ly/2svNegb. Accessed: 28 Apr. 2019.Borge, R., et al.: Comparison of road traffic emission models in Madrid (Spain). Atmos. Environ. 62, 461–471, (2012)Yu, C., et al.: Integrated optimization of traffic signals and vehicle trajectories at isolated urban intersections. Transp. Res. Part B Methodol. 112, 89–112 (2018)Dresner, K., Stone, P.: Multiagent traffic management: An improved intersection control mechanism. In: Proceedings of the International Conference on Autonomous Agents, (2005).Salman, M.A., Ozdemir, S., Celebi, F.V.: Fuzzy traffic control with vehicle‐to‐everything communication. Sensors (Switzerland), 18 (2), Art. no. 368 (2018)Almeida, A.M.R., Macedo, J.A.F., Machado, J.C., Optimization of urban semaphore times turning into JSSP. In: CEUR Workshop Proceedings, 2018.Mousavi, S.S., Schukat, M., Howley, E.: Traffic light control using deep policy‐gradient and value‐function‐based reinforcement learning. IET Intel. Transport Syst. 11(7), 417–423 (2017) http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png IET Intelligent Transport Systems Wiley

Intelligent IoT systems for traffic management: A practical application

Loading next page...
 
/lp/wiley/intelligent-iot-systems-for-traffic-management-a-practical-application-HpgEkvSIJ0

References (59)

Publisher
Wiley
Copyright
© 2021 The Institution of Engineering and Technology
eISSN
1751-9578
DOI
10.1049/itr2.12021
Publisher site
See Article on Publisher Site

Abstract

INTRODUCTIONIntelligent Transportation Systems (ITS) are witnessing a paradigm shift in their modus operandi. Intelligence, understood as the capability of a system to assist, manage, and make decisions to improve key performance metrics, is entering a new reality. One of the reasons lies in prompt advances in traffic data acquisition [1]. This increasing amount of available data allows better and more responsible use of resources. And the origin of these data is vast. Still, in general, we can affirm that either from sensor devices or from Road Side Units (RSU), in‐vehicle communication, Vehicle‐to‐Vehicle (V2V), Vehicle‐to‐Infrastructure (V2I), Vehicle‐to‐Everything (V2X), etc. is being transformed thanks to IoT devices. These IoT devices can be embedded in practically any system or device, offering the ability to communicate, provide information, or automate tasks. Besides, these IoT devices have low energy consumption, which allows them to be integrated into battery‐powered devices, thus extending their useful life.Traffic management, as a part of traffic control, represents a relevant segment of the studies carried out in ITS. The Panel on Future Directions in Control, Dynamics, and Systems identified in [2] several challenges for control systems with a straightforward application in ITS: the notion of operating in a distributed system and the need for coordination and autonomy. Currently, ITS are distributed by nature; hence an effort is needed to visualize the system as such [3]. A distributed system is usually defined as a set of independent computing elements whose hardware and software components communicate and coordinate their actions only through passing messages, appearing to the final user as a whole system (transparency). Among others, design challenges in distributed systems include performance, robustness, and reliability, with emphasis on achieving these features in the context of communication within ITS. Regarding coordination and autonomy, Murray et al. explain in [2] that the study and development of robust control systems require more elaboration in order to realize higher level decision‐making systems. The benefits of such decisions are enormous: to enhance the efficiency of the vehicle transportation system by improving safety, reducing pollution (particle and noise), decreasing fuel consumption, and increasing service quality (e.g. with shorter vehicles’ travel and waiting times). For instance, according to the National Highway Traffic Safety Administration data [4], incorrect driver decisions account for 33% of accidents, being driver recognition the top one reason (41%). These figures are expected to be significantly reduced with proper processing of collected data, and an adequate system‐intelligence applied.For intelligent and autonomous traffic light control at intersections, researchers have explored a wide range of approaches. The general methods that have been used to create smart, autonomous control systems for signaled intersections have been diverse, such as fuzzy logic [5–7], reservation and market‐based system [8–10], neural networks [11–13], reinforcement learning [14–18], and swarm intelligence and evolutionary computation [19–21].The main problem with most of these algorithms is that they are complex to run in real‐time on IoT devices, demanding the resources of cloud computing [22]. But with the rise of Smart Cities, autonomous cars, 5G, and ITS, using the advantages offered by IoT (integration, embedded, speed, simplicity, etc.) can be a definite advantage for the development of intelligent traffic light control systems. In this paper, we face the problem of combining IoT with intelligent algorithms. Random Early Detection for Vehicles Dynamic (REDVD) is a very simple and efficient mechanism that optimizes the phases and cycles of traffic lights in signaled intersections [23]. However, REDVD outperforms other mechanisms in a stand‐alone intersection, it does not behave as expected in more complex scenarios. The reason lies in the number of configuration parameters that influence its performance. Therefore, we propose to use an evolutionary algorithm (EA) to obtain the best parameter configuration of REDVD. Then, the performance of this new enhanced version, called improved REDVD (iREDVD), is tested under complex and previously‐unknown traffic scenarios. The Key Performance Indicators (KPI) used in this study are: waiting time, trip time, travel speed, emissions (specifically, CO, CO2, HC, PMx, and NOx), and fuel consumption. In the light of our outcomes, iREDVD is a light‐by‐design method that can be deployed in IoT devices and outperforms not only REDVD but also other well‐known traffic management methods in all the KPI under study.The rest of the paper is organised as follows. In Section 2, we briefly summarise the state of the art. Section 3 describes our proposal. In Section 4, the optimization process is detailed. The methodology followed for the performance evaluation is explained in Section 5. In Section 6, we show and discuss the obtained results. The paper ends summarising the most important outcomes.RELATED WORKSWithin evolutionary computation, there are two main branches. First, we have those that use genetic algorithms (GAs) to represent the phases and timing of traffic lights as a set of chromosomes and perform optimisation directly on these chromosomes (could also be included within the fuzzy logic). Second, we have those that use GA as an optimisation algorithm to optimise a traffic light control system with a large number of parameters, which is unfeasible to optimise otherwise. The main advantage of these algorithms is that they are light to run but require a large computational load for training or for adjusting their parameters. However, this disadvantage can be mitigated by working in the cloud during a training phase and updating the algorithm easily once finished.Within the first group, it stands out the work carried out by Sánchez et al. [24], where GA is used to encode a fuzzy logic controller in the chromosomes and find the optimal parameters according to the number of cars waiting. Also, similar works on optimising traffic light timings using GA are presented in [21], [25]. An approach linking GAs with device communication (D2D) can be found in [26]. This D2D approach is used to collect information from sensors and actuators, as well as to try to reduce the response time.Concerning the second group, it is within this one that our work is encompassed, since we use a GA to optimise the traffic light control algorithm. The advantage of this method is that it allows us to shift the complexity of the search for the parameters of the control algorithm to a training scenario. This allows us to adapt the control algorithm to numerous scenarios and changes in the environment and act in advance to planned changes to be able to study the scenario beforehand. Other works use optimisation algorithms similar to GAs, such as ant colony optimization (ACO) algorithms. An ACO algorithm is a probabilistic technique for solving computational problems that can be reduced to find good paths through graphs. Within this category, the work done by Abdul Rehman et al. [27] and Kponyo Jerry et al. [28] stand out, where both solve the traffic congestion problem applying ACO.Differing from previous works in the existing literature, our work focuses on optimising an adaptive traffic light control algorithm known as REDVD by means of a GA. Our improved version of this algorithm is called iREDVD. The novelty lies in the use of GAs for the optimisation process, facilitating the task of setting the best values for the numerous configuration parameters that the algorithm has. To the best of our knowledge, whereas most works from the related literature include GAs in traffic management procedures themselves, we only use it as an offline step. Then, we demonstrate that once the parameters are set, iREDVD works successfully under unknown traffic scenarios keeping its lightness feature and without the burden of AI. Therefore, it is possible to use it in IoT devices.RANDOM EARLY DETECTION FOR VEHICLESAs seen in the previous section, there are several approaches to traffic light controlled intersection control algorithms. In this section, we will show the basis of the ITS proposed in [23], namely REDVD, which serves as the basis for this work.The operating principle of REDVD is the Random Early Detection (RED) [29] algorithm, used for congestion control in communications networks. RED is a congestion control technique extensively used in packet‐switching communication networks. It is used to detect and avoid congestion and delay at an early stage and to improve the throughput rate. RED avoids congestion by randomly discarding packets, depending on the number of packets waiting to be processed in the router. These packets can be dropped if the packet queue to be processed exceeds a certain threshold; the dropping probability increases as the queue size increases. By doing so, congestion can be avoided even before it occurs. For more information about RED, please refer to [29].Note that the concept of a queue at a router and a queue at a traffic light in a signalled intersection are analogous. As proposed in [23], the analogy with intersections occurs when, instead of discarding packets, we increase the green‐time when many vehicles are queuing to cross an intersection. With this, we can get more vehicles crossing the intersection. The first adaptation of RED for traffic control in urban scenarios was introduced in [23] and was called RED for Vehicles (REDV). REDV adapts the RED algorithm to the particularities of vehicles and signalled intersections. The idea was as follows. In REDV, the number of vehicles waiting to cross at each intersection branch is similar to the number of packets queued for processing in a router. Instead of dropping a packet with a certain probability that depends on the number of packets to be processed, what REDV does is to increase or decrease the green time of each intersection branch with a probability that depends on the number of cars waiting to cross. Thus, when there are many cars on a branch, the green time for that branch increases frequently. Otherwise, when there are too few cars waiting to cross, the green time is likely reduced. In any case, the total cycle of the intersection was a fixed value. REDV demonstrated significant performance as an adaptive control system, managing incipient congestion at isolated intersections proactively.To deal with changes in the traffic flow rate i.e. in the number of vehicles crossing the intersection per unit of time, it was necessary to adapt the total cycle. The idea was to provide long cycles to accommodate a large volume of traffic and short cycles when the volume of traffic was small. Consequently, the same authors introduced in [23] a new version called REDV Dynamic (REDVD). Although the performance in terms of traffic metrics was improved, this adaptation added new adjustable parameters to the configuration, parameters that need to be appropriately tuned for the algorithm's optimal performance. Table 1 includes all the configuration parameters of REDVD.1TABLEREDVD parametersParameterDescriptionminthThe minimum threshold of vehicles to increase the green‐time phases probabilistically.maxthThe maximum threshold of vehicles to increase the green‐time phases probabilistically.deltaThe increase/decrease of the green‐time phase, either because it is above/below the maxth/minth thresholds, respectively or because of a probabilistic increase when it is between these thresholds.min_greentimeThe minimum time for a green‐time phase.max_greentimeThe maximum time for a green‐time phase.limincNumber of consecutive times the green‐time phase must be increased, taking into account both arteries to increase the cycle for all phases.limdecNumber of consecutive times the green‐time phase must be decreased, taking into account both arteries to decrease the cycle for all phases.delta_cycleThe increase/decrease of the cycle for all phases.min_cycleThe minimum cycle for all phases.max_cycleThe maximum cycle for all phases.wqFactor that determines the weight of the historical value versus the current value in the queue length.maxpFactor used in RED algorithm.These new parameters contribute to higher flexibility in traffic control, being adaptive to continuously changing traffic conditions. However, the process of optimising this large set of parameters is not trivial. Indeed, the values used in [23] for configuring REDVD were obtained empirically. For this reason, one of the goals of this work is to find out the optimal configuration able to offer the best performance in terms of the average waiting time per intersection for each vehicle. This optimisation will be carried out using an EA, more specifically, employing a GA, which is the most popular type of EA. This version optimised by means of a GA is called iREDVD. The entire optimisation process is described in the following section.OPTIMISATION PROCESSA GA consists of a series of mechanisms inspired by the theory of evolution to optimise a set of parameters within a problem. It allows us to optimise that problem, a priori complicated, using a set of light procedures with a low computation load, which derives in a fast parameter optimisation [30], [31].The general procedure consists of four phases: initialize population, fitness calculation, selection, and crossover, which form a generation. The overall optimisation process will take as many generations as needed. A population is a set of individuals. The number of genes that each individual contains is determined by the number of parameters to be optimized. In our case, the parameters are included in Table 1. And each gene contains the coded information of each parameter; e.g. in our case, the maxp parameter will be a float number between 0 and 1. As an example, in a problem with six binary parameters to optimise (such as the knapsack problem [32]), an example of population would be the one shown in Figure 1. In this figure, it can be seen a population of four individuals (I1 to I4), each one with six parameters to optimise (P1 to P6), and each of these parameters is coded in binary (0 or 1) in each gene.1FIGUREExample of a population of a GA, which consists of four individuals and six binary parameters to be optimisedFor each generation, the best‐adapted individuals to the environment are selected so that the next generation individuals contain their genes. The meaning of “better adapted to the environment” is given by the fitness function, which numerically measures how well an individual can adjust to the problem being optimised. For example, in tasks of humanoid robots learning to walk where genes encode the movement of joints and muscles, the fitness function will be the distance travelled, and those individuals who have gone the furthest will be most likely to be selected. Next, we describe the four phases of the general optimisation procedure in detail (see Figure 2).2FIGUREA general GA procedure. The convergence condition can be different, either several entire generations or a fitness value improvement after a few generationsInitialize populationThe procedure starts by initialising the population of individuals with random genes (although other approaches can be followed in order to reduce the convergence time [33], [34]). The population's size is a crucial factor for the correct convergence of the GA because it causes more diversity in the population and ensures that a large amount of the parameter space is being explored.Nevertheless, there is an important trade‐off between the population's size, genetic diversity, and convergence speed. If the population size is considerable, then we have great genetic diversity by exploring a wide range of the parameter space at random. Still, it will take a long time to get the fitness of all the individuals in the population. The size of the population depends strongly on the problem studied as to cover a considerable range of the parameters, it is necessary to increase the population's size exponentially. A basic (minimum) recommendation is to have twice more individuals than the parameters studied. Typical population sizes can be between 20 and 100 individuals [30], [35], [36].Fitness calculationEach individual in the population is evaluated, and its level of fitness is obtained with its parameter settings. For instance, if we are designing a robot that is able to walk, the fitness of each individual would be how far it can walk [37], or in the knapsack problem [32] the adjustment would be the benefit obtained by the objects inside the knapsack that minimize the weight. In our case, it will be the average waiting time per intersection for each vehicle.SelectionOnce all the individuals in the population have been evaluated, the next step is to select the subset of individuals that are best‐adapted to the environment. There are various mechanisms for selecting the best individuals [38–40] e.g. roulette wheel selection, Boltzmann selection, tournament selection, rank selection, steady‐state selection, or Elitism selection, among others. One of the simplest and most effective methods is to select the percentage of individuals who have the highest level of fitness. With this mechanism, the subset of best‐adapted individuals to the environment will be obtained in the current generation. These selected individuals are known as parents and will be the ones from whom the children are generated.The percentage of individuals selected is crucial for the correct functioning of the GA. A very high value will prevent rapid convergence, but a very low value will nevertheless lead to low genetic diversity and a high possibility of converging towards a local minimum. Typical values are between 5% and 10%, depending on the population size [30], [35], [36]. The remaining individuals (children) will be generated from these parents to obtain the total population employing crossover and mutation mechanisms.Although there are very sophisticated methods for the selection mechanism, it is recommended [41] to use rank selection in the first generations to quickly obtain preliminary results. More sophisticated techniques can be found in [40].CrossoverCrossover is a mechanism by which new individuals (children) are generated from two or more parents’ direct mating. This mechanism is done to obtain children with the genes of those parents with a good fitness value. The basic operation of this mechanism consists of randomly selecting genes from different parents to generate new children. There are various crossover mechanisms, such as the selection of a single crossover point, the selection of multiple crossover points, or the gene‐by‐gene combination [42–45]. The mechanism used in this work is the single crossover point due to its simplicity and good results [44]. In this mechanism, a crossing point is selected. Then, from the beginning of the genes to the crossing point is transferred from one parent and the rest is transferred from the second parent.The crossover rate indicates how often parents are crossed over to get new children, so it controls what percentage of the children are exact copies of the parents and which ones are crossed over. That is, if the crossover rate is 100%, all the children are obtained by crossing the parents. However, if the crossover rate is 0%, all the children are exact copies of the parents. This 0% does not mean that the new generation will be identical to the parents since, after the crossover, the genes of these created children can be mutated. The crossover rate should generally be high, between 80% and 95% [30], [35], [36].MutationIn order to obtain a high genetic diversity, mutation mechanisms can be applied to a small percentage of genes in the population. This mutation means that the value of each gene can vary randomly with a small probability. The mutation prevents the GA from converging to a local optimum, as it allows other combinations of genes that can achieve better fitness levels to be explored at random [46], [47]. However, this mutation should not occur very often, since then it would become a random search.The mutation rate indicates how often an individual's gene will be mutated. If the mutation rate is 0%, the population created after the crossover does not mutate. However, if the mutation rate is 100%, all the offspring's genes are changed randomly, within a range of change, which depends on the scenario. This margin of change should be less than 20% of the allowed range for each gene [47]. In turn, the mutation rate should remain around 0.1% and 1%, to enable the exploration of a wider space of parameters, obtaining genetic diversity, but without falling into exploring the entire parameter space at random [48], [49]. In addition, it is advisable to decrease the mutation rate, as well as the margin of change of the parameters, as the generations go by, in order to obtain a rapid convergence [30], [35], [36]. Figure 2 shows the general procedure followed by the GA until the convergence condition is satisfied.For iREDVD, we choose a population size of 256 individuals, 16 selected parents, a crossover rate of 80%, a mutation rate of 0.1%, and up to 20 generations.MATERIALS, SCENARIOS, AND METHODSThe algorithms under study have been evaluated via computer simulations. All methods were programmed in Python v3.7. To compare the different proposals, these scripts were coupled to the microscopic traffic simulator SUMO [50] v1.0.1 with TraCI (Traffic Control Interface). SUMO is a valuable tool as it allows flexible simulations in customizable scenarios and editable traffic patterns. The CPU used was an Intel Xeon 16 cores @ 2.6GHz. Three scenarios were used in this work, one for training and two for testing. The details of each scenario can be seen in Table 2. These three scenarios share some characteristics and differ in others, which are described in the next subsections.2TABLECharacteristics of the training and the testing scenariosScenarioNumber of intersectionsIntersection layoutDistance between intersectionsSimulation durationTrain164×4300m10hTest151×5200m12hTest210010×10250m12hOn the one hand, the training scenario consists of a grid of 4×4 intersections (in total 16 intersections) with 2 lanes for each direction, and a distance between intersections of 300m (see Figure 3). The vehicle flow rate fluctuated over time, remaining constant for one hour and following the pattern shown in Figure 4 afterward. The low flow rate was 600 vehicles/hour/branch, the medium flow rate 1200 vehicles/hour/branch, and the high flow rate 1600 vehicles/hour/branch. The north (N) and south (S) branches had identical flow rates, and the same applies to the east (E) and west (W) branches. As shown in Figure 4, there are times when the flow rate is symmetric (e.g. hours 0, 1, and 2) and others when is asymmetric (e.g. hours 3, 4, and 5).3FIGURESimulated topology for the training scenario: Manhattan 4×4 network with 300 m between each intersection4FIGUREVehicle flow rate per branch used in the training scenarioOn the other hand, the testing scenarios are used to evaluate the performance in new conditions i.e. situations the algorithm has not been trained for. Simulations run in the testing scenarios use the optimal parameters obtained previously in the training. The test scenario 1 consists of a large avenue of 5 intersections (1×5) separated by 200 m between each intersection, as shown in Figure 5(a). The test scenario 2 consists on a 10×10 Manhattan grid, with a total of 100 intersections, as can be seen in Figure 5(b).5FIGURESimulated topology for the testing scenarios: (a) 1×5 network with 200m between each intersection; (b) Manhattan 10×10 network with 250m between each intersectionBoth test scenarios have the same vehicle flow rate, depicted in Figure 6. The vehicle flow rate is more chaotic than in the training scenario, hence, stressing the different algorithms under study. As depicted in Figure 6, the constant vehicle flow rate stretches are maintained for 15 min, and the low, medium, and high flow rates are 700, 1000, and 1800 vehicles/hour/branch, respectively. With these test scenarios, we check that iREDVD algorithm is not overfitted to the training scenario.6FIGUREVehicle flow rate per branch used in the testing scenariosIn all scenarios, the selected vehicle distribution was based on the fleet of vehicles of the city of Madrid (Spain) [51]. This distribution can be seen in Table 3. Also, in order to obtain both fuel consumption and pollutant emissions from vehicles, the pollution model used was the HBEFA [52]. HBEFA is an emission factor database for road vehicles and provides emission factors (hot start, cold start, evaporation) for all regulated and important unregulated air pollutants as well as for fuel consumption.3TABLEType of vehiclesVehiclePercentageFuelCar30%GasolineCar40%DieselMotorcycle10%GasolineMoped10%GasolineVan5%DieselBus5%Average of all fuel typesSimilarly, each intersection had four traffic lights that control each branch of the intersection (North, South, East, West) in all scenarios. Each intersection had two lanes on each branch. An example of an intersection is depicted in Figure 7. For simplicity, only the straight forward and the right‐turn movements were allowed (see Figure 7). This constrain can be found in numerous articles [53–57]. Mainly, it allows a quickly/simple implementation of a new concept to study the advantages and disadvantages it can offer. When adding the left turn, only one new phase needs to be incorporated in the whole cycle, thus allowing the vehicles to turn left. Therefore, we consider that the selected configuration is appropriate to show the benefits of the optimization process using the GA and the robust performance of iREDVD. The phases of the traffic lights were coupled (North with South and East with West) and were opposite (North and South vs. East with West) between each couple. In other words, if the traffic lights in the North and South branches were green, it means that East and West branches were red. Figure 8 illustrates an example of a cycle. The yellow‐time interval of the traffic lights was set to 3 seconds and the all‐red time to 2 seconds, for safety and realism. The safety distance used between cars was 1.5 m. A Poisson distribution was used to generate the traffic flows.7FIGUREAn example of a four‐way signaled intersection, as used in this study8FIGUREExample of a cycle in a traffic light. N≡North; S≡South; W≡West; E≡EastIn order to compare the performance of iREDVD with other algorithms for traffic control, we also implement and evaluate the performance of a system with fixed cycle (FX) time and with pre‐set offsets between consecutive intersections, known as Green Wave (GW). This offset is obtained as the time that a vehicle needs to move between two intersections once the green phase starts (i.e. in a coordinated manner like a green wave). Three different versions were studied for the FX and GW algorithms, which also modify the total cycle time. The cycle times of 30, 45, and 60 seconds were studied. The nomenclature used for each algorithm is expressed as the algorithm used and the cycle time; so, for example, FX45 corresponds to the fixed algorithm with a cycle time of 45 seconds, equally divided between the branches. In all cases, 10 experiments were run for each algorithm under study in each scenario, obtaining the mean value and its standard deviation.Finally, the optimised metric was the normalised waiting time (the waiting time per intersection), although trip time, average speed, CO, CO2, HC, PMx and NOx emissions, and fuel consumption were also studied. The normalised waiting time eliminates the influence of the number of intersections crossed by vehicles in each flow. Therefore, we obtain for each vehicle the ratio between the total waiting time and the number of intersections crossed during the travel.PERFORMANCE EVALUATION RESULTSTraining scenarioFigure 9 illustrates the value of the normalised waiting time of the vehicles per intersection during the optimisation process carried out by the GA in the training phase. As it is shown, after about five generations, the GA achieves promising values for most of the simulated individuals. Although the simulation could have been stopped at that time, we left it more to find a robust set of parameters, which over time are able to demonstrate their superiority over the rest. The time required for each of the simulations in the optimisation process for the test scenario was approximately 1 min. This means that simulating the 256 individuals for 21 generations (the initial one plus the next 20 generations) on a server with 16 processing cores (i.e. 16 simulations were simulated in parallel) required approximately 48 hours. Each simulation was repeated 10 times, and the mean value and standard deviation of each variable under study were obtained. Once the best parameter configuration has been obtained, the execution of the traffic light control algorithm is almost instantaneous since it no longer requires any artificial intelligence algorithm since it is based on a light, fast, and quick method that can be rapidly executed by low power IoT devices. Further information can be found in [23].9FIGUREOptimization process. “Best fitness” is the individual with the lowest normalised waiting time and “Average fitness” is the average normalised waiting time of all population. X‐Axis represents the generationsThe values of the optimised parameters after training are shown in Table 4. The optimisation process shows very interesting values. The parameter limdec indicates how many times the green time of the traffic lights is reduced so that the total cycle of the intersection is reduced. The fact that limdec is 1, means that if a reduction in traffic is detected, even a slight one, it is worthwhile to shorten the total cycle time (and therefore the green time) quickly. However, the opposite is true for the increase in total cycle time. The parameter liminc controls the increase in the total cycle time of the traffic lights when traffic increases. With a high value of 5, it means that iREDVD should be very sure before increasing the total cycle time.4TABLEOptimised parameters and valuesParameterOptimal valueminth10maxth25min_greentime15max_greentime60min_cycle45max_cycle130delta10delta_cycle15maxp0.85wq0.7limin5limdec1The results obtained in the training scenario (see Table 5) corroborate that iREDVD has an improvement of more than 50% (reduction in almost 25 seconds) in the average waiting time of the vehicles per intersection when compared to the traditional algorithms (Fixed 30, Fixed 45, Fixed 60, GreenWave 30, GreenWave 45, and GreenWave 60).5TABLETrain resultsAlgorithmAvg. norm. waiting time (s)Avg. trip time (s)Avg. speed (m/s)Avg. CO emissions (mg)Avg. CO2 emissions (g)Avg. HC emissions (mg)Avg. PMx emissions (mg)Avg. NOx emissions (mg)Avg. Fuel consumption (ml)TraditionalFX3051,5 ± 0,11213,96 ± 0,237,25 ± 0,016642,68 ± 21,51783,90 ± 4,49131,83 ± 1,0862,75 ± 0,563988,72 ± 31,07315,59 ± 1,79FX4548,3 ± 0,04201,81 ± 0,097,61 ± 0.026184,13 ± 24,62739,80 ± 3,07122,93 ± 0,2258,39 ± 0,143749,9 ± 15,14297,69 ± 1,11FX6051,64 ± 0,06204,76 ± 0,107,53 ± 0.016463,22 ± 50,19739,23 ± 2,78124,46 ± 1,4258,87 ± 0,693754,17 ± 26,77297,47 ± 1,19GW3046,13 ± 0,08203,29 ± 0,117,65 ± 0.046016,83 ± 36,41751,81 ± 2,05122,43 ± 0,3858,54 ± 0,163788,61 ± 14,89302,59 ± 0,71GW4550,02 ± 0,04206,29 ± 0,077,64 ± 0.016427,96 ± 20.95748,24 ± 2,40125,86 ± 0,8959,63 ± 0,433796,72 ± 19,92301,2 ± 0,99GW6047,4 ± 0,07199,45 ± 0,077,82 ± 0.026178,43 ± 18,74725,66 ± 5,34120,27 ± 1,0656,98 ± 0,523665,26 ± 34,72291,95 ± 2,11REDV original62,07 ± 4,01234,22 ± 7,617,22 ± 0,097560,28 ± 363,68819,66 ± 15,15143,19 ± 5,0267,75 ± 2,214221,82 ± 96,234221,82 ± 96,23REDVD original29,58 ± 0,33184,06 ± 0,678,27 ± 0,034708,8 ± 27,36723,41 ± 1,67107,39 ± 1,0052,51 ± 0,513573,78 ± 16,833573,78 ± 16,83iREDVD21,37 ± 1,86167,2 ± 2,919,14 ± 0,164074,01 ± 135,67672,11 ± 10,1396,01 ± 2,3546,96 ± 1,153275,5 ± 59,963275,5 ± 59,96Improvement of iREDVD vs traditional.*Abs‐24,76‐32,251,32‐1942,82‐53,54‐24,27‐10,02‐389,76‐21,83%53,6716,1616,8732,287,3720,1817,5810,637,47Improvement of iREDVD vs REDV orig.Abs‐40,7‐67,021,92‐3486,27‐147,54‐47,19‐20,79‐946,32‐59,92%65,5728,6126,5946,1118,0132,9630,6822,4218,15Improvement of iREDVD vs REDVD orig.Abs‐8,21‐16,860,87‐634,79‐51,29‐11,39‐5,55‐298,28‐20,85Improvement %27,759,1610,5113,487,0910,6010,568,347,16*Improvement when compared to the best traditional algorithm.Note: mean ± std of 10 testCompared with the original REDV algorithm proposed in [23], our improvement is still visible. This reinforces the fact that adjusting the parameters is necessary for the proper functioning of the algorithm. Finally, compared to the original REDVD variant also presented in [23], there is an improvement of 27%, which means a reduction of more than 8 seconds in the average normalised waiting time. Regarding other key performance metrics, such as average travel time, average speed, average CO, CO2, HC, PMx and NOx emissions, and average fuel consumption, we can see that the iREDVD improves in all of these KPI in the range of 7%–45%; most notably an increase in speed of 16%–26%, a decrease in average CO emissions of 13%–46%, and a decrease in average fuel consumption of 7%–18%, compared to traditional algorithms.Testing scenariosAfter the training, we stressed the algorithm in the testing scenarios using the optimised values for the configuration parameters. It can be seen from the results shown in Tables 6 and 7 that improvements are very similar to those obtained in the training scenario. This corroborates that iREDVD is able to adjust to unknown conditions (scenarios that it was not trained for), showing its robustness to be applied in real deployments.6TABLETest 1 resultsAlgorithmAvg. norm. waiting time (s)Avg. trip time (s)Avg. speed (m/s)Avg. CO emissions (mg)Avg. CO2 emissions (g)Avg. HC emissions (mg)Avg. PMx emissions (mg)Avg. NOx emissions (mg)Avg. Fuel consumption (ml)TraditionalFX3021,22 ± 0,25169,9 ± 1,217,34 ± 0,025395,04 ± 65,59597,56 ± 4,11102,91 ± 0,7348,99 ± 0,323073,18 ± 22,29240,61 ± 1,6FX4519,2 ± 0,03150,47 ± 0,167,72 ± 0,014756,42 ± 23,49551,99 ± 1,5492,4 ± 0,2744,06 ± 0,152814,95 ± 5,68222,12 ± 0,58FX6021,74 ± 0,01153,05 ± 0,087,58 ± 0,025047,54 ± 20,13555,22 ± 3,0295,51 ± 0,845,26 ± 0,422844,12 ± 23,67223,56 ± 1,17GW3017,68 ± 0,39154,05 ± 1,797,81 ± 0,044586,92 ± 69,72558,36 ± 3,4991,28 ± 0,9143,83 ± 0,412829,06 ± 20,56224,73 ± 1,4GW4519,25 ± 0,03151,67 ± 0,157,83 ± 0,054803,28 ± 19,67554,43 ± 3,2792,62 ± 0,7444,21 ± 0,382825,08 ± 21,72223,09 ± 1,28GW6021,36 ± 0,02151,66 ± 0,127,75 ± 0,034919,4 ± 18,56555,01 ± 3,0993,9 ± 1,1944,67 ± 0,552839,66 ± 22,56223,32 ± 1,19REDV original25,38 ± 0,71152,55 ± 2,947,73 ± 0,054663,73 ± 120,77547,49 ± 5,1591,42 ± 1,9143,61 ± 0,862789,18 ± 34,95220,32 ± 2,14REDVD original17,66 ± 0,47172,52 ± 2,997,49 ± 0,084771,9 ± 84,21580,91 ± 7,7095,48 ± 1,8645,93 ± 0,912953,28 ± 51,28233,81 ± 3,08iREDVD11,6 ± 0,27132,6 ± 1,658,38 ± 0,063406,27 ± 66,39511,34 ± 3,0777,09 ± 1,5637,62 ± 0,72536,01 ± 23,01205,74 ± 1,36Improvement of iREDVD vs traditional.*Abs‐6,08‐17,870,55‐1180,65‐40,64‐14,19‐6,21‐278,94‐16,38%34,3811,877,0225,737,3615,5414,169,917,37Improvement of iREDVD vs REDV orig.Abs‐13,78‐19,950,65‐1257,46‐36,14‐14,33‐5,99‐253,17‐14,58%54,2913,078,4026,966,6015,6713,739,076,61Improvement of iREDVD vs REDVD orig.Abs‐6,06‐39,920,89‐1365,63‐69,57‐18,39‐8,31‐417,27‐28,07Improvement %34,3123,1311,8828,6111,9719,2618,0914,1312,0*Improvement compared to the best traditional algorithm.Note: mean ± std of 10 test7TABLETest 2 resultsAlgorithmAvg. norm. waiting time (s)Avg. trip time (s)Avg. speed (m/s)Avg. CO emissions (mg)Avg. CO2 emissions (g)Avg. HC emissions (mg)Avg. PMx emissions (mg)Avg. NOx emissions (mg)Avg. fuel consumption (ml)TraditionalFX30139,05 ± 0,04428,63 ± 0,436,49 ± 0,0114273,45 ± 55,481508,16 ± 10,14266,46 ± 1,25125,60 ± 0,337738,98 ± 7,48608,09 ± 0,87FX45131,08 ± 0,07400,10 ± 0,856,85 ± 0,0213064,96 ± 44,841415,19 ± 4,25246,19 ± 0,84116,05 ± 1,157274,22 ± 10,44569,66 ± 2,15FX60134,60 ± 0,21399.23 ± 0,376,90 ± 0,0513388,89 ± 14,351396,32 ± 7,19246,41 ± 3,44115,58 ± 0,887181,57 ± 8,21561,98 ± 1,49GW30122,31 ± 0,18403,24 ± 0,596,69 ± 0,0512634,05 ± 43,011443,93 ± 13,25245,45 ± 2,25116,67 ± 0,917738,98 ± 4,97581,48 ± 0,45GW45136,20 ± 1,01407,89 ± 0,896,97 ± 0,0213394,84 ± 24,121435,91 ± 6,84252,15 ± 0,98118,97 ± 1,017274,22 ± 11,02577,83 ± 1,04GW60122,79 ± 0,50382,22 ± 1,217,28 ± 0,0112415,55 ± 33,211348,19 ± 9,50231,86 ± 1,21109,01 ± 2,457185,77 ± 6,81542,51 ± 0,76REDV original140,69 ± 2,15421,36 ± 8,165,98 ± 0,1115584,88 ± 33,861550,40 ± 3,46308,69 ± 3,48130,18 ± 2,387911,04 ± 15,01624,14 ± 1,24REDVD original99,22 ± 8,33374,45 ± 5,748,12 ± 0,188847,37 ± 76,841321,84 ± 2,62212,11 ± 1,1299,01 ± 1,226812,88 ± 21,66539,69 ± 4,18iREDVD21,45 ± 0,24286,85 ± 0,159,39 ± 0,116764,93 ± 27,111286,49 ± 0,96178,71 ± 1,5289,27 ± 0,696249,30 ± 25,21517,07 ± 0,59Improvement of iREDVD vs traditional.*Abs‐101,34‐95,372,11‐5650,62‐61,70‐53,15‐19,74‐932,27‐25,44%82,4624,9528,9845,514,5822,9218,1112,984,69Improvement of iREDVD vs REDV orig.Abs‐119,24‐134,533,41‐8829,95‐263,91‐128,98‐40,91‐1661,74‐107,07%84,7531,9357,0256,6217,0242,1131,4321,0117,15Improvement of iREDVD vs REDVD orig.Abs‐77,77‐87,601,27‐2082,44‐35,35‐33,39‐9,74‐563,58‐22,62Improvement %78,3823,3916,6423,542,6715,749,838,274,19*Improvement compared to the best traditional algorithm.Note: mean ± std of 10 testFor example, iREDVD achieves a reduction in average normalised waiting time of 34%–49% in Test 1 and 78%–82% in Test 2, compared to other control techniques, which is equivalent to reducing the average waiting time of vehicles at each intersection crossed by more than 6–100 seconds. This improvement is also reflected in other metrics, such as the reduction of average CO emissions by around 25% or a reduction in average fuel between 6%–17%.Final remarksThe excellent performance of the optimised iREDVD is due to the ability to adapt to road conditions. In Figure 10(a) and Figure 10(b), it can be observed how iREDVD proactively adapts the cycle time according to the simulated traffic flow rate, not only in training Figure 10(a), but also under unknown conditions in the testing Figure 10(b). The cycle increases when the flow rate of vehicles passing through the intersection increases, and vice versa. In addition, we can see in Figure 10(a) and Figure 10(b) that the time assigned to each branch is independent, since when the traffic is asymmetric [see Figure 10(b) hour 4] iREDVD obtains a time for each branch that is different, self‐adjusting to the traffic conditions. This independence allows iREDVD a better time distribution and a reduction of waiting time.10FIGUREVehicular flow rate simulated, time for each branch, and total cycle for intersection 1 vs. simulation time: (a) train scenario; (b) test scenarioCONCLUSIONWith the growing number of vehicles, the massification of large cities, and the rise of the IoT, there is a growing need for orchestration for smart cities. The use of ITS allows for efficient traffic control when applied to traffic‐light managed intersections, improving traffic flow, and diminishing pollution and fuel use. In this work, we have presented the optimisation and performance of the iREDVD ITS method using an EA as the optimisation tool. iREDVD is a lightweight, optimised, and fast algorithm capable of being executed in IoT devices with limited requirements, whereas controls signaled intersections in a very efficient way. We have shown that iREDVD outperforms traditional traffic light control techniques, as well as its different previous versions. Compared to traditional control techniques, it showed reductions between 24 to 100 seconds in the waiting time of vehicles per intersection travelled through, equivalent to between 50% and 80% reduction in the waiting time of vehicles at traffic lights. Compared to the original REDVD, iREDVD reduces the waiting time at each intersection by more than 27%. Besides, iREDVD reduces both the pollutant emissions and fuel consumption between 7%–38% and between 7%–14%, respectively.In conclusion, the use of these ITS will allow future smart cities to orchestrate the different actors better, allowing for increased safety, reduced pollution, and optimised transportation systems. As future work, research is being carried out on the integration of ITS with 5G technology, to obtain a fully connected traffic management ecosystem.ACKNOWLEDGEMENTSThis work was supported by the AEI/FEDER, UE project grant TEC2016‐76465‐C2‐1‐R (AIM), by DGT‐Ministerio del Interior, project grant SPIP2017‐02230 (STREET), and by a pre‐doctoral contract for the formation of the research personnel financed by Consejería de Empleo, Universidades, Empresa y Medio Ambiente of the CARM, through the Fundación Séneca ‐ Agencia de Ciencia y Tecnología of the Region of Murcia, Spain.REFERENCESZhang, J., et al.: Data‐driven intelligent transportation systems : A survey. IEEE Trans. Intell. Transp. Syst. 12 (4), 1624–1639 (2011)Murray, B.R.M., et al.: The Panel on Future Directions in Control, Dynamics, and Systems. IEEE Control Syst. Mag. 23 (2), 20–33 (2003)Cano, M.‐.D., et al.: Coordination and agreement among traffic signal controllers in urban areas. In: International Conference on Transparent Optical Networks, 2016.Singh, S.: Critical reasons for crashes investigated in the National Motor Vehicle Crash Causation Survey. Natl. Highw. Traffic Saf. Adm.1–2 (2015). Washington, DC. https://crashstats.nhtsa.dot.gov/Api/Public/ViewPublication/812115Pappis, C.P., Mamdani, E.H.: A fuzzy logic controller for a traffic junction. IEEE Trans. Syst. Man Cybern. 143(1–4), 73–97 (1977)Chiu, S., Chand, S.: Adaptive traffic signal control using fuzzy logic. In: 1st IEEE Regional Conference on Aerospace Control Systems, AEROCS 1993 ‐ Proceedings, 1993.Niittymäki, J., Pursula, M.: Signal control using fuzzy logic. Fuzzy Sets Syst., 16(1), 11–22 (2000)Dresner, K., Stone, P.: Multiagent traffic management: A reservation‐based intersection control mechanism. In: Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2004.Vasirani, M., Ossowski, S.: A market‐inspired approach to reservation‐based urban road traffic management. In: Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, 2009.Li, Z., et al.: Modeling reservation‐based autonomous intersection control in VISSIM. Transp. Res. Rec. 2381 (1), 81–90 (2013)Wei, C.H.: Analysis of artificial neural network models for freeway ramp metering control. Artif. Intell. Eng. 15(3), 241–252 (2001)Srinivasan, D., Choy, M.C., Cheu, R.L.: Neural networks for real‐time traffic signal control. IEEE Trans. Intell. Transp. Syst. 7(3), 261–272 (2006)Tubaishat, M., Shang, Y., Shi, H.: Adaptive traffic light control with wireless sensor networks. In: 4th Annual IEEE Consumer Communications and Networking Conference, CCNC 2007.Mousavi, S.S., Schukat, M., Howley, E.: Traffic light control using deep policy‐gradient and value‐function‐based reinforcement learning. IET Intell. Transp. Syst. 11(7), 417–423 (2017)Genders, W., Razavi, S.: Evaluating reinforcement learning state representations for adaptive traffic signal control. Procedia Comput. Sci. 130, 26–33 (2018)Liang, X., et al.: A deep reinforcement learning network for traffic light cycle control. IEEE Trans. Veh. Technol., 68 (2), 1243–1253 (2019)Balaji, P.G., German, X., Srinivasan, D.: Urban traffic signal control using reinforcement learning agents. IET Intell. Transp. Syst. 4(3), 177–188 (2010)Arel, I., et al.: Reinforcement learning‐based multi‐agent system for network traffic signal control. IET Intell. Transp. Syst. 4(2), 128–135 (2010)Sánchez, J., Galán, M., Rubio, E.: Applying a traffic lights evolutionary optimization technique to a real case: ‘Las Ramblas’ area in Santa Cruz de Tenerife. IEEE Trans. Evol. Comput. 12(1), 25–40 (2008)Garcia‐Nieto, J., Olivera, A.C., Alba, E.: Optimal cycle program of traffic lights with particle swarm optimization. IEEE Trans. Evol. Comput. 17(6), 823–839 (2013)Segredo, E., et al.: Optimising Real‐World Traffic Cycle Programs by Using Evolutionary Computation. IEEE Access, 7, 43915–43932 (2019)McKenney, D., White, T.: Distributed and adaptive traffic signal control within a realistic traffic simulation. Eng. Appl. Artif. Intell. 26 (1), 574–583 (2013)Sanchez‐Iborra, R., Cano, M.‐.D.: On the similarities between urban traffic management and communication networks: Application of the random early detection algorithm for self‐regulating intersections. IEEE Intell. Transp. Syst. Mag. 9 (4), 48‐61 (2017)Sánchez‐Medina, J.J., Galán‐Moreno, M.J., Rubio‐Royo, E.: Traffic signal optimization in La Almozara District in Saragossa under congestion conditions, using genetic algorithms, traffic microsimulation, and cluster computing. IEEE Trans. Intell. Transp. Syst. 11(1), 132–141 (2010)Teo, K.T.K., Kow, W.Y., Chin, Y.K., Optimization of traffic flow within an urban traffic light intersection with genetic algorithm. In: Proceedings ‐ 2nd International Conference on Computational Intelligence, Modelling and Simulation, CIMSim 2010.Tang, C., et al.: Traffic signal phase scheduling based on device‐to‐device communication. IEEE Access 6, 47636–47645 (2018)Rehman, A., et al.: Vehicular traffic optimisation and even distribution using ant colony in smart city environment. IET Intell. Transp. Syst. 12 (7), 594–601 (2018).Jerry, K., et al.: NetLogo implementation of an ant colony optimisation solution to the traffic problem. IET Intell. Transp. Syst. 9(9), 862–869 (2015)Floyd, S., Jacobson, V.: Random early detection gateways for congestion avoidance. IEEE/ACM Trans. Netw. 1 (4), 397–413 (1993)Srinivas, M., Patnaik, L.M.: Genetic Algorithms: A Survey. Computer 27(6), 17–26 (1994)Roberge, V., Tarbouchi, M., Labonte, G.: Comparison of parallel genetic algorithm and particle swarm optimization for real‐time UAV path planning. IEEE Trans. Ind. Informatics 9 (1), 132–141 (2013)Chu, P.C., Beasley, J.E.: A genetic algorithm for the multidimensional knapsack problem. J. Heuristics 4(1), (1998)Viveros‐Jiménez, F., León‐Borges, J.A., Cruz‐Cortés, N.: An adaptive single‐point algorithm for global numerical optimization. Expert Syst. Appl. 41 (3), 877–885 (2014)Toǧan, V., Daloǧlu, A.T.: An improved genetic algorithm with initial population strategy and self‐adaptive member grouping. Comput. Struct. 86 (11–12), 1204‐1218 (2008)Johnson, J.M., Rahmat‐Samii, Y.: Genetic algorithms in engineering electromagnetics. IEEE Antennas Propag. Mag. 39 (4), 7–21 (1997)Venkatesh, P., Gnanadass, R., Padhy, N.P.: Comparison and application of evolutionary programming techniques to combined economic emission dispatch with line flow constraints. IEEE Trans. Power Syst. 18 (2), 688–697 (2003)Dip, G., Prahlad, V., Kien, P.D.: Genetic algorithm‐based optimal bipedal walking gait synthesis considering tradeoff between stability margin and speed. Robotica, 27(3), 355–365 (2009)Goldberg, D.E., Deb, K.: A comparative analysis of selection schemes used in genetic algorithms. 1, 69–93 (1991)Blickle, T., Thiele, L.: A comparison of selection schemes used in evolutionary algorithms. Evol. Comput. 4 (4), 361–394 (1996)Jebari, K., Madiafi, M.: Selection methods for genetic algorithms. Int. J. Emerg. Sci. 3, 333–344 (2013)Reeves, C.R., Rowe, J.E., Genetic Algorithms—Principles and Perspectives. Kluwer, Boston (2002)Kora, P., Yadlapalli, P.: Crossover operators in genetic algorithms: A review. Int. J. Comput. Appl. 6 (1), (2017)Magalhães‐Mendes, J.: A comparative study of crossover operators for genetic algorithms to solve the job shop scheduling problem. WSEAS Trans. Comput. 12 (4), 164–173 (2013)Spears, W.M., Anand, V.: A study of crossover operators in genetic programming. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer, Berlin (1991)Abdoun, O., Abouchabaka, J.: A comparative study of adaptive crossover operators for genetic algorithms to resolve the traveling salesman problem. 31 (11), 975–8887 (2012)Srinivas, M., Patnaik, L.M.: Adaptive probabilities of crossover and mutation in genetic algorithms. IEEE Trans. Syst. Man Cybern. 24 (4), 656–667 (1994)Back, T.: Optimal mutation rates in genetic search. In: Proceedings of the fifth international conference on genetic algorithms, 1993, 2–8.Hesser, J., Mäinner, R.: Towards an optimal mutation probability genetic algorithms. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer, Berlin (1991)Greenwell, R.N., Angus, J.E., Finck, M.: Optimal mutation probability for genetic algorithms. Math. Comput. Model. 21(8), 1–11 (1995)Lopez, P.A., et al.: Microscopic Traffic Simulation using SUMO. In: 21st International Conference on Intelligent Transportation Systems (ITSC), 2018, 2575–2582.“Madrid's Vehicle Fleet. 2018. [Online]. Available: bit.ly/2svNegb. Accessed: 28 Apr. 2019.Borge, R., et al.: Comparison of road traffic emission models in Madrid (Spain). Atmos. Environ. 62, 461–471, (2012)Yu, C., et al.: Integrated optimization of traffic signals and vehicle trajectories at isolated urban intersections. Transp. Res. Part B Methodol. 112, 89–112 (2018)Dresner, K., Stone, P.: Multiagent traffic management: An improved intersection control mechanism. In: Proceedings of the International Conference on Autonomous Agents, (2005).Salman, M.A., Ozdemir, S., Celebi, F.V.: Fuzzy traffic control with vehicle‐to‐everything communication. Sensors (Switzerland), 18 (2), Art. no. 368 (2018)Almeida, A.M.R., Macedo, J.A.F., Machado, J.C., Optimization of urban semaphore times turning into JSSP. In: CEUR Workshop Proceedings, 2018.Mousavi, S.S., Schukat, M., Howley, E.: Traffic light control using deep policy‐gradient and value‐function‐based reinforcement learning. IET Intel. Transport Syst. 11(7), 417–423 (2017)

Journal

IET Intelligent Transport SystemsWiley

Published: Feb 1, 2021

Keywords: Computer communications; Mobile radio systems; Optimisation techniques; Traffic engineering computing; Vehicle mechanics; Optimisation techniques; Other topics in statistics; Internet software

There are no references for this article.