Apple – Page 60 – Vedere AI

Prompting for a Conversation: How to Control a Dialog Model?

October 24, 2022

by Apple

Dialog modelling faces a difficult trade-off. Models are trained on a large amount of text, yet their responses need to be limited to a desired scope and style of a dialog agent. Because the datasets used to achieve the former contain language that is not compatible with the latter, pre-trained dialog models are fine-tuned on smaller curated datasets. However, the fine-tuning process robs them of the ability to produce diverse responses, eventually reducing them to dull conversation partners. In this paper we investigate if prompting can mitigate the above trade-off. Specifically, we…Apple Machine Learning Research

A Treatise On FST Lattice Based MMI Training

October 24, 2022

by Apple

Maximum mutual information (MMI) has become one of the two de facto methods for sequence-level training of speech recognition acoustic models. This paper aims to isolate, identify and bring forward the implicit modelling decisions induced by the design implementation of standard finite state transducer (FST) lattice based MMI training framework. The paper particularly investigates the necessity to maintain a preselected numerator alignment and raises the importance of determinizing FST denominator lattices on the fly. The efficacy of employing on the fly FST lattice determinization is…Apple Machine Learning Research

Non-Autoregressive Neural Machine Translation: A Call for Clarity

October 24, 2022

by Apple

Non-autoregressive approaches aim to improve the inference speed of translation models by only requiring a single forward pass to generate the output sequence instead of iteratively producing each predicted token. Consequently, their translation quality still tends to be inferior to their autoregressive counterparts due to several issues involving output token interdependence. In this work, we take a step back and revisit several techniques that have been proposed for improving non-autoregressive translation models and compare their combined translation quality and speed implications under…Apple Machine Learning Research

Latent Temporal Flows for Multivariate Analysis of Wearables Data

October 18, 2022

by Apple

Increased use of sensor signals from wearable devices as rich sources of physiological data has sparked growing interest in developing health monitoring systems to identify changes in an individual’s health profile. Indeed, machine learning models for sensor signals have enabled a diverse range of healthcare related applications including early detection of abnormalities, fertility tracking, and adverse drug effect prediction. However, these models can fail to account for the dependent high-dimensional nature of the underlying sensor signals. In this paper, we introduce Latent Temporal Flows…Apple Machine Learning Research

SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks

October 13, 2022

by Apple

Recent isotropic networks, such as ConvMixer and vision transformers, have found significant success across visual recognition tasks, matching or outperforming non-isotropic convolutional neural networks (CNNs). Isotropic architectures are particularly well-suited to cross-layer weight sharing, an effective neural network compression technique. In this paper, we perform an empirical evaluation on methods for sharing parameters in isotropic networks (SPIN). We present a framework to formalize major weight sharing design decisions and perform a comprehensive empirical evaluation of this design…Apple Machine Learning Research

The 2023 AI/ML Residency Program Application is now Open

October 12, 2022

by Apple

Apple Machine Learning Research

ECCV 2022

October 11, 2022

by Apple

Apple Machine Learning Research

The Calibration Generalization Gap

October 11, 2022

by Apple

This paper was accepted at the Workshop on Distribution-Free Uncertainty Quantification at ICML 2022.
Calibration is a fundamental property of a good predictive model: it requires that the model predicts correctly in proportion to its confidence. Modern neural networks, however, provide no strong guarantees on their calibration— and can be either poorly calibrated or well-calibrated depending on the setting. It is currently unclear which factors contribute to good calibration (architecture, data augmentation, overparameterization, etc), though various claims exist in the literature. We…Apple Machine Learning Research

Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures

October 11, 2022

by Apple

his paper considers the Pointer Value Retrieval (PVR) benchmark introduced in [ZRKB21], where a `reasoning’ function acts on a string of digits to produce the label. More generally, the paper considers the learning of logical functions with gradient descent (GD) on neural networks. It is first shown that in order to learn logical functions with gradient descent on symmetric neural networks, the generalization error can be lower-bounded in terms of the noise-stability of the target function, supporting a conjecture made in [ZRKB21]. It is then shown that in the distribution shift setting, when…Apple Machine Learning Research

Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents

October 11, 2022

by Apple

The perception system in personalized mobile agents requires developing indoor scene understanding models, which can understand 3D geometries, capture objectiveness, analyze human behaviors, etc. Nonetheless, this direction has not been well-explored in comparison with models for outdoor environments (e.g., the autonomous driving system that includes pedestrian prediction, car detection, traffic sign recognition, etc.). In this paper, we first discuss the main challenge: insufficient, or even no, labeled data for real-world indoor environments, and other challenges such as fusion between…Apple Machine Learning Research

Vedere AI

Posts in category: Apple

Prompting for a Conversation: How to Control a Dialog Model?

A Treatise On FST Lattice Based MMI Training

Non-Autoregressive Neural Machine Translation: A Call for Clarity

Latent Temporal Flows for Multivariate Analysis of Wearables Data

SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks

The 2023 AI/ML Residency Program Application is now Open

ECCV 2022

The Calibration Generalization Gap

Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures

Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents

Navigation

GenAI Vision Endless Possibilities

"I'm interested in things that change the world or that affect the future and wondrous, new technology where you see it, and you're like, 'Wow, how did that even happen? How is that possible?'" -- Elon Musk

Copyright © 2019-2025 Vedere AI. All Rights Reserved.