TensorFlow Serving-TensorFlow 服务

2017-08-09 20:32:22 +08:00
 fendouai_com
TensorFlow 服务是一个用于服务机器学习模型的开源软件库。它处理机器学习的推断方面,在培训和管理他们的生命周期后采取模型,通过高性能,引用计数的查找表为客户端提供版本化访问。

可以同时提供多个模型,或者实际上是同一模型的多个版本。这种灵活性有助于新版本,非原子性迁移客户端到新模型或版本,以及 A / B 测试实验模型。

主要用途是高性能生产服务,但是同样的服务基础设施也可以用于批量处理(例如地图缩减)作业以预先计算推理结果或分析模型性能。在这两种情况下,GPU 可以显着增加推理吞吐量。TensorFlow 服务提供了一个调度程序,可以将单个推理请求分组批量,以便在 GPU 上进行联合执行,并配置延迟控制。

TensorFlow 服务对 TensorFlow 模型(自然)具有开箱即用的支持,但它的核心是管理任意版本的项目( servables ),并将其传递给其本机 API。除了经过训练的 TensorFlow 模型之外,服务器还可以包括推理所需的其他资产,如嵌入,词汇和特征转换配置,甚至非基于 TensorFlow 的机器学习模型。

TensorFlow Serving is an open-source software library for serving machine learning models. It deals with the inference aspect of machine learning, taking models after training and managing their lifetimes, providing clients with versioned access via a high-performance, reference-counted lookup table.

Multiple models, or indeed multiple versions of the same model, can be served simultaneously. This flexibility facilitates canarying new versions, non-atomically migrating clients to new models or versions, and A/B testing experimental models.

The primary use-case is high-performance production serving, but the same serving infrastructure can also be used in bulk-processing (e.g. map-reduce) jobs to pre-compute inference results or analyze model performance. In both scenarios, GPUs can substantially increase inference throughput. TensorFlow Serving comes with a scheduler that groups individual inference requests into batches for joint execution on a GPU, with configurable latency controls.

TensorFlow Serving has out-of-the-box support for TensorFlow models (naturally), but at its core it manages arbitrary versioned items (servables) with pass-through to their native APIs. In addition to trained TensorFlow models, servables can include other assets needed for inference such as embeddings, vocabularies and feature transformation configs, or even non-TensorFlow-based machine learning models.

更多教程:
http://www.tensorflownews.com/2017/08/09/google-tensorflow-serving-library/
3416 次点击
所在节点    TensorFlow
0 条回复

这是一个专为移动设备优化的页面(即为了让你能够在 Google 搜索结果里秒开这个页面),如果你希望参与 V2EX 社区的讨论,你可以继续到 V2EX 上打开本讨论主题的完整版本。

https://www.v2ex.com/t/381766

V2EX 是创意工作者们的社区,是一个分享自己正在做的有趣事物、交流想法,可以遇见新朋友甚至新机会的地方。

V2EX is a community of developers, designers and creative people.

© 2021 V2EX