Vacant Parking Space Detection based on TaskConsistency and Reinforcement Learning


Abstract

  In this paper, we proposed a novel task-consistency learning method that allows training a vacant space detection network (target task) based on the logistic consistency with the semantic outcomes from a naive flow-based motion behavior classifier (source task) in a parking lot. By well designing the reward mechanism upon semantic consistency, we show the possibility to train the target network in a reinforcement learning setting. Compared with conventional supervised detection methods, the major contribution of this work is to learn a vacant space detector via semantic consistency rather than supervised labels. The dynamic learning property may make the proposed detector been deployed in different lots easily without heavy training loads. The experiments show that based on the task consistency rewards from the motion behavior classifier, the vacant space detector can be trained successfully.


Experiment

  Several demos had been prepared to compare our RL based model with a supervised learning-based model. The first scenario is a light changing scenario. Other demos present performance under different car density levels. The sparse scenario means only a few cars in a parking lot, the for less occlusion pattern could be observed. The dense scenario means the parking lot is nearly full and all the car is affected by occlusion patterns.

Light Changing Scenario

Light Changing Scenario-Supervised learning with 34688 samples

Light Changing Scenario-RL learning with 824 training trajectories

Sparse Scenario

Sparse Scenario-Supervised learning with 34688 samples

Sparse Scenario-RL learning with 824 training trajectories

Moderate Scenario

Moderate Scenario-Supervised learning with 34688 samples

Moderate Scenario-RL learning with 824 training trajectories

Dense Scenario

Dense Scenario-Supervised learning with 34688 samples

Dense Scenario-RL learning with 824 training trajectories