Sensors and Materials

Young Researcher Paper Award 2025
🥇Winners

Notice of retraction
Vol. 32, No. 8(2), S&M2292

Print: ISSN 0914-4935
Online: ISSN 2435-0869
Sensors and Materials
is an international peer-reviewed open access journal to provide a forum for researchers working in multidisciplinary fields of sensing technology.

Tweets by Journal_SandM Sensors and Materials
is covered by Science Citation Index Expanded (Clarivate Analytics), Scopus (Elsevier), and other databases.

Instructions to authors
English 日本語

Instructions for manuscript preparation
English 日本語

Template
English

Publisher
MYU K.K.
Sensors and Materials
1-23-3-303 Sendagi,
Bunkyo-ku, Tokyo 113-0022, Japan
Tel: 81-3-3827-8549
Fax: 81-3-3827-8547

MYU Research, a scientific publisher, seeks a native English-speaking proofreader with a scientific background. B.Sc. or higher degree is desirable. In-office position; work hours negotiable. Call 03-3827-8549 for further information.

MYU Research
(proofreading and recording)

MYU K.K.
(translation service)

The Art of Writing Scientific Papers
(How to write scientific papers)
(Japanese Only)

Sensors and Materials, Volume 37, Number 9(3) (2025)
Copyright(C) MYU K.K.

pp. 4309-4321
S&M4183 Technical paper of Special Issue
https://doi.org/10.18494/SAM5937
Published: September 30, 2025

Employee Work Behavior Monitoring Using Multimodal Large Language Models [PDF]

Yushi Chen, Chung-Hsing Chao, Linjing Liu, and Cheng-Fu Yang

(Received September 16, 2025; Accepted September 24, 2025)

Keywords: multimodal large language models, employee behavior monitoring, smart office, prompt engineering, privacy protection

With the rapid advancement of artificial intelligence, enterprises increasingly demand efficient and flexible solutions for employee work behavior monitoring in office environments. Traditional systems often involve high costs, rigidity, and reliance on extensive labeled data. Multimodal large language models (MLLMs), capable of integrating information from text, images, and audio, offer a novel zero-shot inference approach that reduces data dependence and deployment complexity. In this study, we present a practical application framework combining seating area definition, image cropping, and prompt engineering to analyze employee behaviors such as focused screen engagement and nonwork-related interactions. Results are output in a standardized JavaScript Object Notation format facilitating aggregation and actionable insights for human resource management. Additionally, critical privacy and ethical and legal considerations are discussed, along with mitigation strategies to support responsible deployment. Through practical simulation scenarios and cost–benefit analysis, we demonstrate that MLLMs enable scalable and economically viable employee behavior monitoring solutions suitable for small and medium-sized enterprises.

Corresponding author: Linjing Liu and Cheng-Fu Yang

This work is licensed under a Creative Commons Attribution 4.0 International License.

Cite this article
Yushi Chen, Chung-Hsing Chao, Linjing Liu, and Cheng-Fu Yang, Employee Work Behavior Monitoring Using Multimodal Large Language Models, Sens. Mater., Vol. 37, No. 9, 2025, p. 4309-4321.