2026 企业级AI编程实践手册
https://lcnziv86vkx6.feishu.cn/wiki/XZOSwI51wi5a5okxCF4cAxHSnBh
https://lcnziv86vkx6.feishu.cn/wiki/XZOSwI51wi5a5okxCF4cAxHSnBh
outputs 结构,把它们统一还原成 semantic segmentation 的概率图 `(B, C, H, W)`,然后再交给 `RankSEG.predict(...)`。transformers.pipeline(...) patch 方案,至少第一步不这么做。outputs 对象动态挂方法,比如 outputs.rankseg()`,因为 outputs 类型不统一,很多模型还可能 `trust_remote_code=True 返回 tuple 或自定义对象。AutoModelForSemanticSegmentation.from_pretrained(...)processor = SegformerImageProcessor.from_pretrained(...)inputs = processor(images=image, return_tensors="pt")outputs = model(**inputs)outputs 之后、argmax 之前。outputs.logitsSemanticSegmenterOutputoutputs.class_queries_logitsoutputs.masks_queries_logitsoutputs.logitsoutputs.pred_maskstrust_remote_code=True 的模型可能不遵循标准 ModelOutput`,例如有人直接示例里写 `model(input_images)[-1]/Users/lev1s/Documents/GitHub/transformers/src/transformers/pipelines/image_segmentation.py/Users/lev1s/Documents/GitHub/transformers/src/transformers/modeling_outputs.py/Users/lev1s/Documents/GitHub/transformers/src/transformers/models/segformer/image_processing_segformer.py/Users/lev1s/Documents/GitHub/transformers/src/transformers/models/mask2former/image_processing_mask2former.py/Users/lev1s/Documents/GitHub/transformers/src/transformers/models/detr/modeling_detr.py/Users/lev1s/Documents/GitHub/transformers/src/transformers/models/oneformer/modeling_oneformer.pyrankseg/paddleseg 兼容层,思路是 rankseg 自己提供兼容 facade,而不是修改 PaddleSeg 上游。rankseg 仓库里设计并实现第一版 Hugging Face outputs 兼容函数,优先做最小可用版本,不要过度设计,不要做大而全封装,不要先做 pipeline monkey patch。semantic_probs_from_outputs(outputs, model=None, image_processor=None, target_sizes=None)predict_semantic_segmentation(outputs, model=None, image_processor=None, target_sizes=None, rankseg_kwargs=None)outputs.logitsoutputs.class_queries_logits + outputs.masks_queries_logitsoutputs.logits + outputs.pred_masks/Users/lev1s/Documents/GitHub/transformers/Users/lev1s/Documents/GitHub/ranksegrankseg repo 还没在本地,先确认路径或 clone 后再继续。
Loading...