Loading [MathJax]/extensions/MathMenu.js
Automatic Generation of Frontend Code from Design Interface | IEEE Conference Publication | IEEE Xplore

Automatic Generation of Frontend Code from Design Interface


Abstract:

Internet technology is constantly evolving, and application software plays a crucial role in various fields. The development of various software relies heavily on the des...Show More

Abstract:

Internet technology is constantly evolving, and application software plays a crucial role in various fields. The development of various software relies heavily on the design of user interfaces and the corresponding code implementation. However, implementing front-end code is a highly repetitive and tedious task. Prior to designing the front-end interface based on the requirements document, designers usually create a simpler interface to assist in designing and implementing the actual interface. The specific focus of this study is to generate a domain-specific language (DSL) file based on a screenshot of the front-end interface using deep learning. The pix2code project was the first to propose using deep learning for DSL text generation. However, the pix2code project has several issues: firstly, the DSL language designed in the project lacks an adequate description vocabulary to meet the basic requirements of describing actual web pages; secondly, the project has poor scalability for the DSL language; and finally, the project's generation effectiveness using deep learning models is average. This study expands on the pix2code project by extending the design of the domain-specific language, introducing new descriptive vocabulary, and developing an algorithm to generate a new dataset. These enhancements allow the DSL to better describe various components and their combinations within the interface. Additionally, three improvements are made to the original deep learning model. Firstly, the convolutional network used for image feature extraction is deepened to enhance feature extraction capabilities. Secondly, a text encoder based on Transformer is designed for text feature extraction. Finally, a bidirectional LSTM network is used for decoding. Experimental results show that the proposed model in this study improves the BLEU score from 0.81 to 0.85 on a publicly available dataset, and from 0.547 to 0.575 on a newly created dataset.
Date of Conference: 08-14 December 2023
Date Added to IEEE Xplore: 29 December 2023
ISBN Information:

ISSN Information:

Conference Location: Cairo, Egypt

Funding Agency:

References is not available for this document.

I. Introduction

The software development process mainly includes requirements analysis, software design, code implementation, and project deployment. Software design encompasses an important aspect known as user interface (UI) design. The focus of this study is to utilize deep learning methods to assist frontend developers in more effectively and conveniently implement code for user interfaces.

Select All
1.
T. Beltramelli, "pix2code: Generating code from a graphical user interface screenshot", Proceedings of the ACM SIGCHI Symposium on Engineering Interactive Computing Systems, pp. 1-6, 2018.
2.
X. Shi, Z. Chen, H. Wang, D.-Y. Yeung, W.-K. Wong and W.-c. Woo, "Convolutional lstm network: A machine learning approach for precipitation nowcasting", Advances in Neural Information Processing Systems, vol. 28, 2015.
3.
Y. Liu, Q. Hu and K. Shu, "Improving pix2code based bi-directional lstm", 2018 IEEE International Conference on Automation Electronics and Electrical Engineering (AUTEEE), pp. 220-223, 2018.
4.
X. Glorot and Y. Bengio, "Understanding the difficulty of training deep feedforward neural networks", Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 249-256, 2010.
5.
Y.-S. Yun, J. Jung, S. Eun, S.-S. So and J. Heo, "Detection of gui elements on sketch images using object detector based on deep neural networks", Proceedings of the Sixth International Conference on Green and Human Information Technology: ICGHIT 2018, pp. 86-90, 2019.
6.
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, et al., "Attention is all you need", Advances in Neural Information Processing Systems, vol. 30, 2017.
7.
M. Mernik, J. Heering and A. M. Sloane, "When and how to develop domain-specific languages", ACM Computing Surveys (CSUR), vol. 37, no. 4, pp. 316-344, 2005.
8.
D. M. Chickering, "Optimal structure identification with greedy search", Journal of Machine Learning Research, vol. 3, pp. 507-554, Nov 2002.
9.
S. Wiseman and A. M. Rush, "Sequence-to-sequence learning as beam-search optimization", arXiv preprint, 2016.
10.
B. Zhou, A. Khosla, A. Lapedriza, A. Oliva and A. Torralba, "Learning deep features for discriminative localization", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2921-2929, 2016.
11.
R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh and D. Batra, "Grad-cam: Visual explanations from deep networks via gradient-based localization", Proceedings of the IEEE International Conference on Comnuter Vision, pp. 618-626, 2017.
12.
J. T. Springenberg, A. Dosovitskiy, T. Brox and M. Riedmiller, "Striving for simplicity: The all convolutional net", International Conference on Learning Representations, May 2015.
13.
M. D. Zeiler and R. Fergus, "Visualizing and understanding convolutional networks", Computer Vision-ECCV 2014: 13th European Conference, pp. 818-833, 2014.
14.
K. He, X. Zhang, S. Ren and J. Sun, "Deep residual learning for image recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.
15.
Z. Liu, Y. Lin, Y. Cao, H. Hu, Y. Wei, Z. Zhang, et al., "Swin transformer: Hierarchical vision transformer using shifted windows", Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 10012-10022, 2021.
16.
K. Papineni, S. Roukos, T. Ward and W.-J. Zhu, "Bleu: a method for automatic evaluation of machine translation", Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 311-318, 2002.
17.
C.-Y. Lin, "Rouge: A package for automatic evaluation of summaries", Text Summarization Branches Out, pp. 74-81, 2004.
18.
S. Banerjee and A. Lavie, "Meteor: An automatic metric for mt evaluation with improved correlation with human judgments", Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pp. 65-72, 2005.
19.
S. Ruder, "An overview of gradient descent optimization algorithms", arXiv preprint, 2016.
20.
J. Deng, J. Guo, N. Xue and S. Zafeiriou, "Arcface: Additive angular margin loss for deep face recognition", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4690-4699, 2019.

Contact IEEE to Subscribe

References

References is not available for this document.