yolo4dAs expected, the YOLO4D models outperform the frame stacking models. Frame stacking encodes the temporal information only through the reshaping of inputs, while YOLO4DYOLO4D is a deep learning approach that uses 4D tensors to incorporate temporal information in 3D object detection from LiDAR point clouds. It extends YOLO v2 with Convolutional LSTM