Conferences >2023 13th International Confe...

Automatic Generation of Frontend Code from Design Interface

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Internet technology is constantly evolving, and application software plays a crucial role in various fields. The development of various software relies heavily on the des...Show More

Metadata

Abstract:

Internet technology is constantly evolving, and application software plays a crucial role in various fields. The development of various software relies heavily on the design of user interfaces and the corresponding code implementation. However, implementing front-end code is a highly repetitive and tedious task. Prior to designing the front-end interface based on the requirements document, designers usually create a simpler interface to assist in designing and implementing the actual interface. The specific focus of this study is to generate a domain-specific language (DSL) file based on a screenshot of the front-end interface using deep learning. The pix2code project was the first to propose using deep learning for DSL text generation. However, the pix2code project has several issues: firstly, the DSL language designed in the project lacks an adequate description vocabulary to meet the basic requirements of describing actual web pages; secondly, the project has poor scalability for the DSL language; and finally, the project's generation effectiveness using deep learning models is average. This study expands on the pix2code project by extending the design of the domain-specific language, introducing new descriptive vocabulary, and developing an algorithm to generate a new dataset. These enhancements allow the DSL to better describe various components and their combinations within the interface. Additionally, three improvements are made to the original deep learning model. Firstly, the convolutional network used for image feature extraction is deepened to enhance feature extraction capabilities. Secondly, a text encoder based on Transformer is designed for text feature extraction. Finally, a bidirectional LSTM network is used for decoding. Experimental results show that the proposed model in this study improves the BLEU score from 0.81 to 0.85 on a publicly available dataset, and from 0.547 to 0.575 on a newly created dataset.

Published in: 2023 13th International Conference on Information Science and Technology (ICIST)

Date of Conference: 08-14 December 2023

Date Added to IEEE Xplore: 29 December 2023

ISBN Information:

ISSN Information:

DOI: 10.1109/ICIST59754.2023.10367169

Conference Location: Cairo, Egypt

Funding Agency:

References is not available for this document.

Contents

I. Introduction

The software development process mainly includes requirements analysis, software design, code implementation, and project deployment. Software design encompasses an important aspect known as user interface (UI) design. The focus of this study is to utilize deep learning methods to assist frontend developers in more effectively and conveniently implement code for user interfaces.

Select All

T. Beltramelli, "pix2code: Generating code from a graphical user interface screenshot", Proceedings of the ACM SIGCHI Symposium on Engineering Interactive Computing Systems, pp. 1-6, 2018.

CrossRef Google Scholar

X. Shi, Z. Chen, H. Wang, D.-Y. Yeung, W.-K. Wong and W.-c. Woo, "Convolutional lstm network: A machine learning approach for precipitation nowcasting", Advances in Neural Information Processing Systems, vol. 28, 2015.

Google Scholar

Y. Liu, Q. Hu and K. Shu, "Improving pix2code based bi-directional lstm", 2018 IEEE International Conference on Automation Electronics and Electrical Engineering (AUTEEE), pp. 220-223, 2018.

View Article

Google Scholar

X. Glorot and Y. Bengio, "Understanding the difficulty of training deep feedforward neural networks", Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 249-256, 2010.

Google Scholar

Y.-S. Yun, J. Jung, S. Eun, S.-S. So and J. Heo, "Detection of gui elements on sketch images using object detector based on deep neural networks", Proceedings of the Sixth International Conference on Green and Human Information Technology: ICGHIT 2018, pp. 86-90, 2019.

CrossRef Google Scholar

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, et al., "Attention is all you need", Advances in Neural Information Processing Systems, vol. 30, 2017.

Google Scholar

M. Mernik, J. Heering and A. M. Sloane, "When and how to develop domain-specific languages", ACM Computing Surveys (CSUR), vol. 37, no. 4, pp. 316-344, 2005.

CrossRef Google Scholar

D. M. Chickering, "Optimal structure identification with greedy search", Journal of Machine Learning Research, vol. 3, pp. 507-554, Nov 2002.

CrossRef Google Scholar

S. Wiseman and A. M. Rush, "Sequence-to-sequence learning as beam-search optimization", arXiv preprint, 2016.

CrossRef Google Scholar

10.

B. Zhou, A. Khosla, A. Lapedriza, A. Oliva and A. Torralba, "Learning deep features for discriminative localization", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2921-2929, 2016.

View Article

Google Scholar

11.

R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh and D. Batra, "Grad-cam: Visual explanations from deep networks via gradient-based localization", Proceedings of the IEEE International Conference on Comnuter Vision, pp. 618-626, 2017.

View Article

Google Scholar

12.

J. T. Springenberg, A. Dosovitskiy, T. Brox and M. Riedmiller, "Striving for simplicity: The all convolutional net", International Conference on Learning Representations, May 2015.

Google Scholar

13.

M. D. Zeiler and R. Fergus, "Visualizing and understanding convolutional networks", Computer Vision-ECCV 2014: 13th European Conference, pp. 818-833, 2014.

CrossRef Google Scholar

14.

K. He, X. Zhang, S. Ren and J. Sun, "Deep residual learning for image recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.

View Article

Google Scholar

15.

Z. Liu, Y. Lin, Y. Cao, H. Hu, Y. Wei, Z. Zhang, et al., "Swin transformer: Hierarchical vision transformer using shifted windows", Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 10012-10022, 2021.

View Article

Google Scholar

16.

K. Papineni, S. Roukos, T. Ward and W.-J. Zhu, "Bleu: a method for automatic evaluation of machine translation", Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 311-318, 2002.

CrossRef Google Scholar

17.

C.-Y. Lin, "Rouge: A package for automatic evaluation of summaries", Text Summarization Branches Out, pp. 74-81, 2004.

Google Scholar

18.

S. Banerjee and A. Lavie, "Meteor: An automatic metric for mt evaluation with improved correlation with human judgments", Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pp. 65-72, 2005.

Google Scholar

19.

S. Ruder, "An overview of gradient descent optimization algorithms", arXiv preprint, 2016.

Google Scholar

20.

J. Deng, J. Guo, N. Xue and S. Zafeiriou, "Arcface: Additive angular margin loss for deep face recognition", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4690-4699, 2019.

View Article

Google Scholar

References is not available for this document.

Automatic Generation of Frontend Code from Design Interface

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Automatic Generation of Frontend Code from Design Interface

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

Authors

Figures

References

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?