Hasil "Art" - JURNALIN

S2 Open Access 2021

High-Resolution Image Synthesis with Latent Diffusion Models

Robin Rombach, A. Blattmann, Dominik Lorenz et al.

By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image data and beyond. Additionally, their formulation allows for a guiding mechanism to control the image generation process without retraining. However, since these models typically operate directly in pixel space, optimization of powerful DMs often consumes hundreds of GPU days and inference is expensive due to sequential evaluations. To enable DM training on limited computational resources while retaining their quality and flexibility, we apply them in the latent space of powerful pretrained autoencoders. In contrast to previous work, training diffusion models on such a representation allows for the first time to reach a near-optimal point between complexity reduction and detail preservation, greatly boosting visual fidelity. By introducing cross-attention layers into the model architecture, we turn diffusion models into powerful and flexible generators for general conditioning inputs such as text or bounding boxes and high-resolution synthesis becomes possible in a convolutional manner. Our latent diffusion models (LDMs) achieve new state of the art scores for image inpainting and class-conditional image synthesis and highly competitive performance on various tasks, including unconditional image generation, text-to-image synthesis, and super-resolution, while significantly reducing computational requirements compared to pixel-based DMs.

23546 sitasi en Computer Science

Detail DOI Sumber

S2 Open Access 2019

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Colin Raffel, Noam Shazeer, Adam Roberts et al.

Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on a downstream task, has emerged as a powerful technique in natural language processing (NLP). The effectiveness of transfer learning has given rise to a diversity of approaches, methodology, and practice. In this paper, we explore the landscape of transfer learning techniques for NLP by introducing a unified framework that converts every language problem into a text-to-text format. Our systematic study compares pre-training objectives, architectures, unlabeled datasets, transfer approaches, and other factors on dozens of language understanding tasks. By combining the insights from our exploration with scale and our new "Colossal Clean Crawled Corpus", we achieve state-of-the-art results on many benchmarks covering summarization, question answering, text classification, and more. To facilitate future work on transfer learning for NLP, we release our dataset, pre-trained models, and code.

25109 sitasi en Mathematics, Computer Science

Detail Sumber

S2 Open Access 2019

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Nils Reimers, Iryna Gurevych

BERT (Devlin et al., 2018) and RoBERTa (Liu et al., 2019) has set a new state-of-the-art performance on sentence-pair regression tasks like semantic textual similarity (STS). However, it requires that both sentences are fed into the network, which causes a massive computational overhead: Finding the most similar pair in a collection of 10,000 sentences requires about 50 million inference computations (~65 hours) with BERT. The construction of BERT makes it unsuitable for semantic similarity search as well as for unsupervised tasks like clustering. In this publication, we present Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity. This reduces the effort for finding the most similar pair from 65 hours with BERT / RoBERTa to about 5 seconds with SBERT, while maintaining the accuracy from BERT. We evaluate SBERT and SRoBERTa on common STS tasks and transfer learning tasks, where it outperforms other state-of-the-art sentence embeddings methods.

17054 sitasi en Computer Science

Detail DOI Sumber

S2 Open Access 2019

Analyzing and Improving the Image Quality of StyleGAN

Tero Karras, S. Laine, M. Aittala et al.

The style-based GAN architecture (StyleGAN) yields state-of-the-art results in data-driven unconditional generative image modeling. We expose and analyze several of its characteristic artifacts, and propose changes in both model architecture and training methods to address them. In particular, we redesign the generator normalization, revisit progressive growing, and regularize the generator to encourage good conditioning in the mapping from latent codes to images. In addition to improving image quality, this path length regularizer yields the additional benefit that the generator becomes significantly easier to invert. This makes it possible to reliably attribute a generated image to a particular network. We furthermore visualize how well the generator utilizes its output resolution, and identify a capacity problem, motivating us to train larger models for additional quality improvements. Overall, our improved model redefines the state of the art in unconditional image modeling, both in terms of existing distribution quality metrics as well as perceived image quality.

6857 sitasi en Computer Science, Engineering

Detail DOI Sumber

S2 Open Access 2019

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Mingxing Tan, Quoc V. Le

Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available. In this paper, we systematically study model scaling and identify that carefully balancing network depth, width, and resolution can lead to better performance. Based on this observation, we propose a new scaling method that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient. We demonstrate the effectiveness of this method on scaling up MobileNets and ResNet. To go even further, we use neural architecture search to design a new baseline network and scale it up to obtain a family of models, called EfficientNets, which achieve much better accuracy and efficiency than previous ConvNets. In particular, our EfficientNet-B7 achieves state-of-the-art 84.4% top-1 / 97.1% top-5 accuracy on ImageNet, while being 8.4x smaller and 6.1x faster on inference than the best existing ConvNet. Our EfficientNets also transfer well and achieve state-of-the-art accuracy on CIFAR-100 (91.7%), Flowers (98.8%), and 3 other transfer learning datasets, with an order of magnitude fewer parameters. Source code is at this https URL.

23199 sitasi en Computer Science, Mathematics

Detail Sumber

S2 Open Access 2019

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, Ming-Wei Chang, Kenton Lee et al.

We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers. Unlike recent language representation models (Peters et al., 2018a; Radford et al., 2018), BERT is designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers. As a result, the pre-trained BERT model can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks, such as question answering and language inference, without substantial task-specific architecture modifications. BERT is conceptually simple and empirically powerful. It obtains new state-of-the-art results on eleven natural language processing tasks, including pushing the GLUE score to 80.5 (7.7 point absolute improvement), MultiNLI accuracy to 86.7% (4.6% absolute improvement), SQuAD v1.1 question answering Test F1 to 93.2 (1.5 point absolute improvement) and SQuAD v2.0 Test F1 to 83.1 (5.1 point absolute improvement).

112729 sitasi en Computer Science

Detail DOI Sumber

S2 Open Access 2019

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Yinhan Liu, Myle Ott, Naman Goyal et al.

Language model pretraining has led to significant performance gains but careful comparison between different approaches is challenging. Training is computationally expensive, often done on private datasets of different sizes, and, as we will show, hyperparameter choices have significant impact on the final results. We present a replication study of BERT pretraining (Devlin et al., 2019) that carefully measures the impact of many key hyperparameters and training data size. We find that BERT was significantly undertrained, and can match or exceed the performance of every model published after it. Our best model achieves state-of-the-art results on GLUE, RACE and SQuAD. These results highlight the importance of previously overlooked design choices, and raise questions about the source of recently reported improvements. We release our models and code.

29091 sitasi en Computer Science

Detail Sumber

S2 Open Access 2018

How Powerful are Graph Neural Networks?

Keyulu Xu, Weihua Hu, J. Leskovec et al.

Graph Neural Networks (GNNs) are an effective framework for representation learning of graphs. GNNs follow a neighborhood aggregation scheme, where the representation vector of a node is computed by recursively aggregating and transforming representation vectors of its neighboring nodes. Many GNN variants have been proposed and have achieved state-of-the-art results on both node and graph classification tasks. However, despite GNNs revolutionizing graph representation learning, there is limited understanding of their representational properties and limitations. Here, we present a theoretical framework for analyzing the expressive power of GNNs to capture different graph structures. Our results characterize the discriminative power of popular GNN variants, such as Graph Convolutional Networks and GraphSAGE, and show that they cannot learn to distinguish certain simple graph structures. We then develop a simple architecture that is provably the most expressive among the class of GNNs and is as powerful as the Weisfeiler-Lehman graph isomorphism test. We empirically validate our theoretical findings on a number of graph classification benchmarks, and demonstrate that our model achieves state-of-the-art performance.

9491 sitasi en Computer Science, Mathematics

Detail Sumber

S2 Open Access 2018

GENERATIVE ADVERSARIAL NETS

Individualized Treat, Jinsung Yoon

40777 sitasi en

Detail Sumber

S2 Open Access 2017

mixup: Beyond Empirical Risk Minimization

Hongyi Zhang, Moustapha Cissé, Yann Dauphin et al.

Large deep neural networks are powerful, but exhibit undesirable behaviors such as memorization and sensitivity to adversarial examples. In this work, we propose mixup, a simple learning principle to alleviate these issues. In essence, mixup trains a neural network on convex combinations of pairs of examples and their labels. By doing so, mixup regularizes the neural network to favor simple linear behavior in-between training examples. Our experiments on the ImageNet-2012, CIFAR-10, CIFAR-100, Google commands and UCI datasets show that mixup improves the generalization of state-of-the-art neural network architectures. We also find that mixup reduces the memorization of corrupt labels, increases the robustness to adversarial examples, and stabilizes the training of generative adversarial networks.

11536 sitasi en Mathematics, Computer Science

Detail Sumber

S2 Open Access 2017

Graph Attention Networks

Petar Velickovic, Guillem Cucurull, Arantxa Casanova et al.

We present graph attention networks (GATs), novel neural network architectures that operate on graph-structured data, leveraging masked self-attentional layers to address the shortcomings of prior methods based on graph convolutions or their approximations. By stacking layers in which nodes are able to attend over their neighborhoods' features, we enable (implicitly) specifying different weights to different nodes in a neighborhood, without requiring any kind of costly matrix operation (such as inversion) or depending on knowing the graph structure upfront. In this way, we address several key challenges of spectral-based graph neural networks simultaneously, and make our model readily applicable to inductive as well as transductive problems. Our GAT models have achieved or matched state-of-the-art results across four established transductive and inductive graph benchmarks: the Cora, Citeseer and Pubmed citation network datasets, as well as a protein-protein interaction dataset (wherein test graphs remain unseen during training).

25490 sitasi en Computer Science, Mathematics

Detail DOI Sumber

S2 Open Access 2017

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

João Carreira, Andrew Zisserman

The paucity of videos in current action classification datasets (UCF-101 and HMDB-51) has made it difficult to identify good video architectures, as most methods obtain similar performance on existing small-scale benchmarks. This paper re-evaluates state-of-the-art architectures in light of the new Kinetics Human Action Video dataset. Kinetics has two orders of magnitude more data, with 400 human action classes and over 400 clips per class, and is collected from realistic, challenging YouTube videos. We provide an analysis on how current architectures fare on the task of action classification on this dataset and how much performance improves on the smaller benchmark datasets after pre-training on Kinetics. We also introduce a new Two-Stream Inflated 3D ConvNet (I3D) that is based on 2D ConvNet inflation: filters and pooling kernels of very deep image classification ConvNets are expanded into 3D, making it possible to learn seamless spatio-temporal feature extractors from video while leveraging successful ImageNet architecture designs and even their parameters. We show that, after pre-training on Kinetics, I3D models considerably improve upon the state-of-the-art in action classification, reaching 80.2% on HMDB-51 and 97.9% on UCF-101.

9346 sitasi en Computer Science

Detail DOI Sumber

S2 Open Access 2017

Attention is All you Need

Ashish Vaswani, Noam Shazeer, Niki Parmar et al.

The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

172412 sitasi en Computer Science

Detail Sumber

S2 Open Access 2017

Enhanced Deep Residual Networks for Single Image Super-Resolution

Bee Lim, Sanghyun Son, Heewon Kim et al.

Recent research on super-resolution has progressed with the development of deep convolutional neural networks (DCNN). In particular, residual learning techniques exhibit improved performance. In this paper, we develop an enhanced deep super-resolution network (EDSR) with performance exceeding those of current state-of-the-art SR methods. The significant performance improvement of our model is due to optimization by removing unnecessary modules in conventional residual networks. The performance is further improved by expanding the model size while we stabilize the training procedure. We also propose a new multi-scale deep super-resolution system (MDSR) and training method, which can reconstruct high-resolution images of different upscaling factors in a single model. The proposed methods show superior performance over the state-of-the-art methods on benchmark datasets and prove its excellence by winning the NTIRE2017 Super-Resolution Challenge[26].

7032 sitasi en Computer Science

Detail DOI Sumber

S2 Open Access 2020

PODNet: Pooled Outputs Distillation for Small-Tasks Incremental Learning

Arthur Douillard, M. Cord, Charles Ollion et al.

858 sitasi en Computer Science

Detail DOI Sumber

S2 Open Access 2013

Bayesian Filtering and Smoothing

Simo Särkkä

1890 sitasi en Mathematics, Computer Science

Detail DOI Sumber

S2 Open Access 2013

Dense Trajectories and Motion Boundary Descriptors for Action Recognition

Heng Wang, Alexander Kläser, C. Schmid et al.

1781 sitasi en Computer Science, Mathematics

Detail DOI Sumber

S2 Open Access 2011

Scikit-learn: Machine Learning in Python

Fabian Pedregosa, G. Varoquaux, Alexandre Gramfort et al.

Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing machine learning to non-specialists using a general-purpose high-level language. Emphasis is put on ease of use, performance, documentation, and API consistency. It has minimal dependencies and is distributed under the simplified BSD license, encouraging its use in both academic and commercial settings. Source code, binaries, and documentation can be downloaded from http://scikit-learn.sourceforge.net.

87694 sitasi en Computer Science

Detail DOI Sumber

DOAJ Open Access 2025

Effect of Short-Term Storage in Modified Atmosphere Packaging (MAP) and Controlled Atmosphere (CA) on Total Polyphenol Content and Antioxidant Activity in Juices from Haskap Berry (Lonicera caerulea L.)

Barbara Anna Kowalczyk, Monika Bieniasz, Jan Błaszczyk

This article focuses on analysing the properties of six varieties of haskap berry (honeyberry) as a valuable raw material for producing health-promoting juices. Significant differences in the content of bioactive compounds were observed between juices derived from fruits of the same species. This study demonstrated that controlled atmosphere (CA) conditions (20% CO2 and 5% O2) and modified atmosphere packaging (MAP) in Xtend bags affected juice quality by minimising nutritional losses. The analysis of polyphenol content in the juices revealed significant differences between varieties and years (2021 and 2022), primarily due to varying weather conditions. In 2022, the polyphenol content of the varieties ‘Usłada’, ‘Candy Blue’, ‘Boreal Beauty’, and ‘Boreal Beast’ was from 69% to twice as high compared to values recorded in 2021. CA and MAP storage conditions were found to be more effective than normal atmosphere (NA) in preserving bioactive components, and thus the antioxidant activity of the fruits, as measured by the DPPH method. The variety ‘Sinij Utes’ had the highest total polyphenol contents and their lowest loss during storage. Conversely, the variety ‘Boreal Beauty’ contained the lowest polyphenol levels both after harvest and storage. This study confirmed the importance of proper storage conditions for maintaining the antioxidant properties of haskap berries.

Agriculture (General)

Detail DOI Sumber

DOAJ Open Access 2025

What types of tobacco control public service advertisements work for Chinese adolescents? A mixed-methods study

Yu Chen*+, Haoyi Liu*+, Shiyu Liu*+ et al.

Introduction Adolescent tobacco use has become a serious global public health problem, and effective tobacco control public service advertisements (PSAs) are crucial for reducing adolescent smoking rates. The study aims to employ a mixedmethods approach combining quantitative surveys and qualitative focus groups to evaluate the effectiveness of different types of tobacco control PSAs among Chinese adolescents, identify effective advertising characteristics and content elements, and provide empirical evidence for optimizing youth tobacco control communication strategies. Methods A total of 125 students aged 10–18 years were recruited from six primary and secondary schools in Beijing and Kunming from November 2020 to April 2021. Participants completed Likert-scale ratings measuring advertisement effectiveness after viewing eight tobacco control PSAs and participated in focus group interviews. Quantitative data were analyzed using independent samples t-tests, Spearman correlation analysis, and multivariable logistic regression analysis, while qualitative data were analyzed using thematic analysis. All statistical tests were two-tailed with significance set at p<0.05. Results Quantitative analysis revealed that PSAs employing ‘testimonials’ and ‘disease’ frameworks were most strongly associated with prevention intentions, while those using ‘celebrity endorsement’, ‘humor’ and ‘appearance damage’ frameworks showed the weakest associations. Kunming adolescents showed significantly higher advertisement acceptance scores than Beijing adolescents (mean difference=0.21; 95% CI: 0.04–0.38, p<0.05). The 10-item effectiveness scale demonstrated good internal consistency (Cronbach’s α=0.82). Qualitative analysis identified effective characteristics including presentation of specific health hazards, use of testimonials, and fear appeals; ineffective characteristics included non-specific harm presentation, use of humorous elements, and appearance damage content. Conclusions Tobacco control PSA design should consider strategies combining disease warnings with real-life testimonials, avoid humorous advertisements and industry-sponsored messaging, and consider regional cultural differences. Distribution through online and social media platforms frequently used by adolescents may enhance reach. Future longitudinal research with broader geographical sampling is needed to confirm these findings.

Diseases of the respiratory system, Neoplasms. Tumors. Oncology. Including cancer and carcinogens

Detail DOI Sumber

Hasil untuk "Art"