W. Benjamin
Hasil untuk "Fine Arts"
Menampilkan 20 dari ~2551048 hasil · dari CrossRef, DOAJ, Semantic Scholar
Chunyuan Li, Jianwei Yang, Pengchuan Zhang et al.
This paper investigates two techniques for developing efficient self-supervised vision transformers (EsViT) for visual representation learning. First, we show through a comprehensive empirical study that multi-stage architectures with sparse self-attentions can significantly reduce modeling complexity but with a cost of losing the ability to capture fine-grained correspondences between image regions. Second, we propose a new pre-training task of region matching which allows the model to capture fine-grained region dependencies and as a result significantly improves the quality of the learned vision representations. Our results show that combining the two techniques, EsViT achieves 81.3% top-1 on the ImageNet linear probe evaluation, outperforming prior arts with around an order magnitude of higher throughput. When transferring to downstream linear classification tasks, EsViT outperforms its supervised counterpart on 17 out of 18 datasets. The code and models are publicly available: https://github.com/microsoft/esvit
J. Kopf, Xuejian Rong, Jia-Bin Huang
We present an algorithm for estimating consistent dense depth maps and camera poses from a monocular video. We integrate a learning-based depth prior, in the form of a convolutional neural network trained for single-image depth estimation, with geometric optimization, to estimate a smooth camera trajectory as well as detailed and stable depth reconstruction. Our algorithm combines two complementary techniques: (1) flexible deformation-splines for low-frequency large-scale alignment and (2) geometry-aware depth filtering for high-frequency alignment of fine depth details. In contrast to prior approaches, our method does not require camera poses as input and achieves robust reconstruction for challenging hand-held cell phone captures containing a significant amount of noise, shake, motion blur, and rolling shutter deformations. Our method quantitatively outperforms state-of-the-arts on the Sintel benchmark for both depth and pose estimations and attains favorable qualitative results across diverse wild datasets.
Obioma Pelka, Sven Koitka, Johannes Rückert et al.
Mohammad Mehedi Hassan, Md. Golam Rabiul Alam, Md. Zia Uddin et al.
Abstract Recently, deep learning methodologies have become popular to analyse physiological signals in multiple modalities via hierarchical architectures for human emotion recognition. In most of the state-of-the-arts of human emotion recognition, deep learning for emotion classification was used. However, deep learning is mostly effective for deep feature extraction. Therefore, in this research, we applied unsupervised deep belief network (DBN) for depth level feature extraction from fused observations of Electro-Dermal Activity (EDA), Photoplethysmogram (PPG) and Zygomaticus Electromyography (zEMG) sensors signals. Afterwards, the DBN produced features are combined with statistical features of EDA, PPG and zEMG to prepare a feature-fusion vector. The prepared feature vector is then used to classify five basic emotions namely Happy, Relaxed, Disgust, Sad and Neutral. As the emotion classes are not linearly separable from the feature-fusion vector, the Fine Gaussian Support Vector Machine (FGSVM) is used with radial basis function kernel for non-linear classification of human emotions. Our experiments on a public multimodal physiological signal dataset show that the DBN, and FGSVM based model significantly increases the accuracy of emotion recognition rate as compared to the existing state-of-the-art emotion classification techniques.
A. Pręgowska, K. Masztalerz, M. Garlińska et al.
Surprisingly, distance education is quite an old concept. Its origins date back to the first correspondence-based course, which took place via the postal service in Boston, USA, in the 18th century. Rapid technological developments, especially in video and audio streaming, have increased the availability of such courses and moved learning into the virtual world. Due to the ongoing COVID-19 pandemic, we are witnessing an accelerated revolution in the learning process, as nearly all forms of education have been shifted online. Will this have a destructive effect on the human psyche? Is humanity sufficiently aware and ready for such a dramatic change? Will we return to physical in-classroom studies, or is remote distance education set to become the new norm? In particular, in medicine, computer science, fine arts, or architectural design, such a rapid change in the way students learn can be quite challenging. In this paper, we provide an overview of the history of distance learning, taking into account teachers’ and students’ points of view in both secondary and higher education.
Yongqing Liang, Xin Li, N. Jafari et al.
We propose a new matching-based framework for semi-supervised video object segmentation (VOS). Recently, state-of-the-art VOS performance has been achieved by matching-based algorithms, in which feature banks are created to store features for region matching and classification. However, how to effectively organize information in the continuously growing feature bank remains under-explored, and this leads to inefficient design of the bank. We introduce an adaptive feature bank update scheme to dynamically absorb new features and discard obsolete features. We also design a new confidence loss and a fine-grained segmentation module to enhance the segmentation accuracy in uncertain regions. On public benchmarks, our algorithm outperforms existing state-of-the-arts.
Liang Qian, Xiwen Zeng, Xiaorong Liu et al.
Correlated Color Temperature (CCT) significantly influences mood, comfort, and potentially overall health. However, its impact on visitors’ visual experience in museum design remains insufficiently explored. This study aims to investigate the effects of different CCT settings (3000 K, 4500 K, 6000 K) on visual comfort within a simulated museum space. Using 3D modeling and physiological recordings, 200 participants assessed visual comfort. Consistent findings support that a CCT of 4500 K provides the highest comfort level, aligning with the observed trend in eye gaze duration. Pupil diameter variability indicates that greater comfort is associated with higher CCT values. While differences in heart rate variability (HRV) were not statistically significant, there is a tendency for HRV to increase with longer fixation durations. These findings challenge literature advocating for lower CCT values in museum lighting, emphasizing the need to balance conservation and visitor experience. This study provides empirical evidence supporting the optimization of visual comfort in museum lighting design through a CCT value of 4500 K, offering valuable insights for practitioners. However, limitations include potential scene disturbance and the simulated environment. Future studies should diversify samples and explore a broader range of CCT values.
Whatley, Katherine G.T.
This article tells the story of a diasporic Jewish family across generations, continents, and languages through a shared name—Katherine—showing how names serve as talismans, linking present and past. Centered on the author’s grandmother, a Hungarian Holocaust survivor who lived in Europe and Australia, and the author, raised in Japan, it explores how Jewish names act as markers of memory, identity, politics, and religion. The author argues that Jewish naming rituals reflect the diasporic, cosmopolitan nature of prewar Jewish society. She examines tensions between assimilation and non-assimilation, secularism and mysticism, nationalism and cosmopolitanism, advocating for a renewed sense of multilingual, cosmopolitan Jewish identity. Drawing on Judaism, Buddhism, and esoteric mysticism, the author presents multilingualism and cosmopolitanism as inherent strengths of Jewish diasporic life—and as vital in today’s world. Through her own translational upbringing and family history, she offers a deeply personal narrative intertwined with 20th-century upheavals and calls for a revival of prewar Jewish cosmopolitanism.
Elisa Dávila Arreza
Haoyu Lu, Mingyu Ding, Yuqi Huo et al.
Large-scale vision-language pre-trained models have shown promising transferability to various downstream tasks. As the size of these foundation models and the number of downstream tasks grow, the standard full fine-tuning paradigm becomes unsustainable due to heavy computational and storage costs. This paper proposes UniAdapter, which unifies unimodal and multimodal adapters for parameter-efficient cross-modal adaptation on pre-trained vision-language models. Specifically, adapters are distributed to different modalities and their interactions, with the total number of tunable parameters reduced by partial weight sharing. The unified and knowledge-sharing design enables powerful cross-modal representations that can benefit various downstream tasks, requiring only 1.0%-2.0% tunable parameters of the pre-trained model. Extensive experiments on 6 cross-modal downstream benchmarks (including video-text retrieval, image-text retrieval, VideoQA, and VQA) show that in most cases, UniAdapter not only outperforms the state-of-the-arts, but even beats the full fine-tuning strategy. Particularly, on the MSRVTT retrieval task, UniAdapter achieves 49.7% recall@1 with 2.2% model parameters, outperforming the latest competitors by 2.0%. The code and models are available at https://github.com/RERV/UniAdapter.
Junbin Xiao, Pan Zhou, Angela Yao et al.
We propose to perform video question answering (VideoQA) in a Contrastive manner via a Video Graph Transformer model (CoVGT). CoVGT's uniqueness and superiority are three-fold: 1) It proposes a dynamic graph transformer module which encodes video by explicitly capturing the visual objects, their relations and dynamics, for complex spatio-temporal reasoning. 2) It designs separate video and text transformers for contrastive learning between the video and text to perform QA, instead of multi-modal transformer for answer classification. Fine-grained video-text communication is done by additional cross-modal interaction modules. 3) It is optimized by the joint fully- and self-supervised contrastive objectives between the correct and incorrect answers, as well as the relevant and irrelevant questions respectively. With superior video encoding and QA solution, we show that CoVGT can achieve much better performances than previous arts on video reasoning tasks. Its performances even surpass those models that are pretrained with millions of external data. We further show that CoVGT can also benefit from cross-modal pretraining, yet with orders of magnitude smaller data. The results demonstrate the effectiveness and superiority of CoVGT, and additionally reveal its potential for more data-efficient pretraining.
Qiangsheng Hu, Peihong Yang, Jiajun Ma et al.
Intangible cultural heritage represents a cultural evolution shaped by human responses to adapting and transforming environments. Unveiling the characteristics and patterns of its spatial distribution can offer a more scientific foundation for the protection of intangible cultural heritage and the development of heritage tourism. By using ArcGIS software and Geo-detector, the regional differentiation characteristics and influencing mechanisms of Intangible Cultural Heritage, specifically the Hometown of Chinese Folk Culture and Art (HCFCA), have been investigated. Results show that the distribution and structure of HCFCA in China vary significantly across regions and categories. Eastern China holds the highest number, while Southern China has fewer. Traditional fine arts and dance dominate, with folk literature being less represented. HCFCA exhibits a clear clustering trend, reflecting a “Polycentric Agglomeration” pattern, particularly in traditional drama, calligraphy, craftsmanship, sports, performing arts, and acrobatics. The study presents an influential mechanism model, with the natural environment as foundational, economic development as the primary driver, social development as supplementary support, and historical background as a catalyzing factor. This understanding not only enriches theoretical insights but also provides a scientific basis for the protection of intangible cultural heritage and the development of heritage tourism.
Kiraniawati Telaumbanua, Berkati Bu’ulolo
Art learning is a child's education that focuses on the development of visual intelligence, also known as visual intelligence. In this context, children are encouraged to have the ability to understand objects thoroughly and in detail. The purpose of this research is to provide an understanding to teachers that the benefits of fine arts are very beneficial for early childhood, this research method uses descriptive qualitative research methods with data review from case studies, literature studies, and books as reference references. The results of this study are in the benefits of fine arts for early childhood can develop children's Fine Motor Skills, develop Imagination and Creativity through fine arts, Increase Problem Solving Skills and Increase Emotional Intelligence.
Valerie Fraser
Ali Hassan Abdellah Mohamed Eldaly
The Art of Animation is always moving towards creativity and distinction, and as long as animation artists succeed in drowning joy and happiness on the faces of adults and children of the audience of that beautiful art through different types and works of art with its multiple techniques whether two - dimensional or three - dimensional animation technology, clay ,Cut out techniques or moving the puppet technique of moving frame by frame it's production forms vary between long and narrative films, short films, serials, promotional advertisements, or animation breaks displayed on satellite channels , we here in this research to shed light on a short animation separator shown on satellite channel during the holy month of Ramadan. Animation it is a Tow - dimensional animation technique to implement rhythmic interval animation. The research also addressed in this rhythmic interval to shed light on the growth of the rhythmic and creativity role of tow - dimensional movement through a rhythmic musical piece that was played with strings, which is a song, and vocalist. The Egyptian . who is full of many traditional songs that have been stuch in the minds and hearts of the Egyptian people old and young .That song was composed and composed by ( Ahmed Abd El-Qader from 1916th aged ), The Egyptian artist and singer who accompanied the opening of the Egyptian Radio at 1934th year and was one of the first singers who participated in singing in its programs since the first week.The rhythmic musician has a role in highlighting and intensifying the dramatic event in animation films, so there must be a dynamic connection between what appears with in the frame of the picture and what the recipients hear, as the music that is not synchronized with the movement may lead to a dramatic negative result in the breaks or animated films. From here this must be translated. The inter connectedness of rhythmic movement and music makes the animation to the dramatic through the perception of the animator.
Inés Magdalena Campos-García Calderón, Doraliza Olivera Mendoza
La Unidad Vecinal n.° 3 (UV3) fue un planteamiento de vivienda social basado en la teoría de la Neighborhood-Unit y la Ciudad-Satélite, donde los Espacios Libres Planificados (ELP) fueron relevantes para la salubridad y el desarrollo comunitario de la población, por lo que fueron ocupados y sus características físico-arquitectónicas fueron transformadas; el objetivo de esta investigación fue identificar la transformación por apropiación de los ELP en la UV3. Mediante un enfoque cualitativo se llevó a cabo un análisis comparativo gráfico del planteamiento original versus la situación actual, análisis documental y observación de campo. Se encontraron cambios a partir de la ubicación de elementos materiales para delimitar y subdividir espacios y la inclusión de estos al espacio residencial, los cambios de uso de área verde colectiva a espacio individual de la vivienda contigua, y la colocación de elementos simbólicos de reconocimiento. Lo cual resulta de distintos tipos de apropiación: según el agente, la naturaleza y las consecuencias. Se concluye que la transformación de los ELP del planteamiento original ha sido posible por la desmesura de sus áreas y fue resultado de diferentes formas de apropiación que han generado un perfil urbano informal.
Anilore Banon
Carl Mika
Māori philosophy is at an exciting point as it looks to other sources for inspiration. In this paper, I refer to some key Māori concepts and terms with Spinoza’s notion of primordial substance in mind. Some Māori terms such as ira (the manifestation and persistence of a thing), whakaaro (indebtedness to a primordial substance) and Papatūānuku (primordial substance) are relevant here. I do not seek to compare Spinoza and Māori thought as such but instead to work with Māori concepts and terms with Spinoza in the background.
Jesús Algovi González Villegas
Obra en vídeo, que se muestra de manera circular (en loop). Se trata de la grabación de una acción sobre una pieza escultórica, realizada en cera con la palabra ACCIÓN. La obra inicia su “cremación”, a modo de vela múltiple hasta su total desaparición (cámara acelerada o time-lapse), que luego vuelve a surgir de sus cenizas hasta formarse y completarse de nuevo. Es una metáfora visual de la propia vida, del resurgir, como ave fénix, que a menudo nos toca experimentar a lo largo de nuestra existencia. Esta pieza ha estado expuesta en sendas individuales en la Galería Weber-Lutgen de Sevilla (“Game Over”, 2018) y en la Feria Internacional de arte contemporáneo CHINA CHENGDU INTERNATIONAL ARTWORK EXPO (República Popular China, 2015). Presentada en el I CONGRESO VIRTUAL INTERNACIONAL CIVARTES 2020.
Halaman 15 dari 127553