Investigasi pengaruh Step Training pada Skema Same-Padding untuk Metode Faster R-CNN dalam Teknologi Augmented Reality
DOI:
https://doi.org/10.22441//fifo.2020.v12i2.002Keywords:
augmented reality, convolutional neural network, faster region-based convolutional neural network, region proposal network, ResNetAbstract
Abstract
Augmented Reality (AR) is a technology with the concept of combining real-world dimensions with virtual world dimensions that are displayed in realtime. In the AR environment, interaction techniques used can vary. Marker-based AR is one type of AR that allows virtual objects to be displayed in the real world by using markers as pointers. In the use of marker-based AR required object detection method used for tracking markers. In this study, a system that can detect objects in the form of fingertips will be designed. In designing the system the Faster Region-based Convolutional Neural Network (Faster R-CNN) method is used. R-CNN Faster is an object detection method which is a combination of the Fast R-CNN method and the Region Proposal Network (RPN). The results of the detection parameters will be used for tracking, namely the coordinates x, y, width, and length. This research uses the Faster R-CNN method because it has a faster computing speed compared to the previous method, namely Particle Filter. The Faster R-CNN method uses ResNet architecture as the core of CNN. The system configuration to be tested is the 25K, 50K and 75K step training with the same-padding scheme. The testing process is taken from a video consisting of 10800 training data and 3600 test data. The best system configuration based on parameter priority for AR technology is obtained in the 50K step training.
Keyword: augmented reality, convolutional neural network, faster region-based convolutional neural network, region proposal network, ResNet.
Abstrak
Augmented Reality (AR) adalah teknologi dengan konsep menggabungkan dimensi dunia nyata dengan dimensi dunia virtual yang ditampilkan secara real-time. Dalam lingkungan AR, teknik interaksi yang digunakan dapat bermacam – macam. Marker-based AR merupakan salah satu jenis AR yang memungkinkan objek virtual ditampilkan ke dalam dunia nyata dengan digunakannya marker sebagai pointer-nya. Dalam penggunaan AR berbasis marker diperlukan metode deteksi objek yang digunakan untuk tracking marker. Dalam penelitian ini akan dirancang sebuah sistem yang dapat mendeteksi objek berupa ujung jari. Dalam perancangan sistem tersebut digunakan metode Faster Region-Based Convolutional Nueral Network (Faster R-CNN). Faster R-CNN merupakan salah satu metode deteksi objek yang merupakan gabungan dari metode Fast R-CNN dan Region Proposal Network (RPN). Hasil dari parameter deteksi akan digunakan untuk tracking, yaitu koordinat x, y, width, dan length. Penelitian ini menggunakan metode Faster R-CNN karena memiliki kecepatan komputasi yang lebih cepat dibandingkan dengan metode sebelumnya yaitu Particle Filter. Metode Faster R-CNN mengunakan arsitektur ResNet sebagai inti dari CNN. Konfigurasi sistem yang akan diuji adalah step training 25K, 50K dan 75K dengan skema same-padding. Proses pengujian diambil dari video yang terdiri dari 10800 data latih dan 3600 data uji. Konfigurasi sistem terbaik berdasarkan prioritas parameter untuk teknologi AR didapatkan pada step training 50K.
Keyword: augmented reality, convolutional neural network, faster region-based convolutional neural network, region proposal network, ResNet.Downloads
References
S. Malik, C. McDonald, and G. Roth. 2002. Hand tracking for interactive pattern-based augmented reality. in Proceedings of the 1st International Symposiumon Mixed and Augmented Reality. IEEE Computer Society.
T. Lee and T. Hollerer. 2007. Handy ar: Markerless inspection of augmented reality objects using fingertip tracking. in 2007 11th IEEE International Symposiumon Wearable Computers. IEEE.
S. S. Rao. 2010. Sixth sense technology. in 2010 International Conference on Communication and Computational Intelligence (INCOCCI). IEEE.
S. Ren, K. He, R. Girshick, and J. Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems.
R. Girshick, J. Donahue, T. Darrell, and J. Malik. 2014. Rich feature hierarchiesfor accurate object detection and semantic segmentation. in Proceedings of the IEEE conference on computer vision and pattern recognition.
F-f. Li, J. Johnson, and S. Yeung. 2018. Lecture 5 Convolutional Neural Networks. pp. 1–78. [Online]. Available: http://cs231n.stanford.edu/slides/2018/cs231n2018lecture05.pdf
R. Girshick. 2015. Fast r-cnn. in Proceedings of the IEEE international conferenceon computer vision.
K. He, X. Zhang, S. Ren, and J. Sun. 2016. Deep residual learning for image recog-nition. in Proceedings of the IEEE conference on computer vision and patternrecognition.
S. J. Pan and Q. Yang. 2009. A survey on transfer learning. IEEE Transactions on knowledge and data engineering.
S. A. Wibowo, H. Lee, E. K. Kim, and S. Kim. 2017. Convolutional shallow features for performance improvement of histogram of oriented gradients in visual object tracking. Mathematical Problems in Engineering.
S. A. Wibowo, H. Lee, E. K. Kim, and S. Kim. 2018. Collaborative learning based on convolutional features and correlation filter for visual tracking. International Journal of Control, Automation and Systems.
S. A. Wibowo, H. Lee, E. K. Kim, and S. Kim. 2017. Visual tracking based on complementary learners with distractor handling. Mathematical Problems in Engineering.
A. Wiranata, S. A. Wibowo, R. Patmasari, R. Rahmania, and R. Mayasari. 2018. Investigation of padding schemes for faster r-cnn on vehi-cle detection. International Conference on Control, Electronics, Renewable Energy and Communications (ICCEREC). IEEE.
A. Y. Nasirudin, S .A. Wibowo, and R. Purnamasari. 2018. Performance analysis on fine-tuned region-based cnn for object recognition. Symposium of Future Telecommunication and Technologies (SOFTT).
Downloads
Published
How to Cite
Issue
Section
License
The copyright to this article is transferred to Universitas Mercu Buana (UMB) if and when the article is accepted for publication. The undersigned hereby transfers any and all rights in and to the paper including without limitation all copyrights to UMB. The undersigned hereby represents and warrants that the paper is original and that he/she is the author of the paper, except for material that is clearly identified as to its original source, with permission notices from the copyright owners where required. The undersigned represents that he/she has the power and authority to make and execute this assignment.
We declare that this paper has not been published in the same form elsewhere.
Furthermore, I/We hereby transfer the unlimited rights of publication of the above-mentioned paper as a whole to UMB. The copyright transfer covers the right to reproduce and distribute the article, including reprints, translations, photographic reproductions, microform, electronic form (offline, online) or any other reproductions of similar nature.
The corresponding author signs for and accepts responsibility for releasing this material on behalf of any and all co-authors. This agreement is to be signed by at least one of the authors who have obtained the assent of the co-author(s) where applicable. After submission of this agreement signed by the corresponding author, changes of authorship or in the order of the authors listed will not be accepted.
Retained Rights/Terms and Conditions
Although authors are permitted to re-use all or portions of the Work in other works, this does not include granting third-party requests for reprinting, republishing, or other types of re-use.
Our Articles are licensed under CC BY-NC

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.









