H04N7/01—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level

H04N7/0112—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level one of the standards corresponding to a cinematograph film standard

Abstract

Translated from Chinese

检测具有一系列接连的场的影片内容的图象重复模式的方法和设备，该设备具有处理装置和存储器(M)，处理装置被安排成实行以下步骤：通过使用分段方法识别接连的场内的多个不同的对象，一个对象被规定为可以用单个运动模型描述的接连的场的图象部分；以及对于多个对象中的每一个对象实行以下步骤：建立接连的场内的每个对象的运动参量图案；把运动参量图案与多个预定的运动参量图案进行比较；以及通过使用前面步骤的结果来确定对于每个对象的图象重复模式。 Repeating pattern of the image of a method and apparatus for detecting video content having a series of successive fields, the apparatus having a processing unit and a memory (M), the processing means being arranged to carry out the steps of: by using a method for identifying segmentation successive field a plurality of different objects, an object can be defined as a series of single motion model describes the portion of the field of the image; and performing the following steps for each of a plurality of objects in an object: establishing for each object within the field of successive motion parameter pattern; the pattern of motion parameters with a plurality of predetermined pattern of motion parameters comparing; and determined by using the result of the previous step is repeated for each object model image.

Description

Translated from Chinese

识别并行地出现在单个电视信号场中的影片和视频对象 Identifying concurrently present in a single field of the television signal and movie video object

发明领域本发明涉及检测影片内容中的活动图象影片源的领域。 Field of the Invention The present invention relates to the field of moving picture film source is detected in the movie content.

现有技术在US-A-5,734,735中，描述了一种分析一系列视频图象的方法和系统。 In the prior art US-A-5,734,735, there is described a method and system for analyzing a series of video images.被使用来产生这些视频图象的产生媒体的类型被加以检测。 Is used to generate the generated video image of the media types are to be detected.每个系列的视频图象被分段成一系列单元，以便保持空间信息。 Each series of video images is segmented into a series of cells, in order to maintain spatial information.空间信息被使用来检测产生媒体的类型。 Spatial information is generated to detect the type of media used.然而，它没有揭示对于来自不同的源和被混合形成单个图象的、在一个图象内的不同的情景检测产生的类型的技术。 However, it does not disclose the type of technology for the different scenarios in a picture from different sources and mixed to form a single image is generated by detected.

US-A-6,014,182也涉及到用于检测活动图象影片源的方法。 US-A-6,014,182 also relates to a method for detecting a moving image of the video source.这样的检测在几种环境下是有用的，例如，行倍增器、电视标准变换器、电视慢运动处理和视频压缩。 Such detection is useful under several environments, e.g., line doubler, a television standards converter, a television slow motion processing and video compression.例如，60Hz NTSC电视信号具有24帧/秒的活动图象影片作为它的源。 For example, 60Hz NTSC television signal having moving picture video 24 frames / second as its source.在这样的方案中，使用3-2下拉比(3-2pull down ratio)，即，三个视频场来自一个影片帧，而下两个视频场来自下一个影片帧，等等。 In such a scheme, using 3-2 pulldown ratio (3-2pull down ratio), i.e., three video fields from one film frame, and the next two video fields come from the next film frame, and so on.例如，把相继的视频场称为A、B、C、D、E，3-2下拉比看起来就像AAABBCCCDDEEE。 For example, the successive video fields is called A, B, C, D, E, 3-2 pull-down than it looks like AAABBCCCDDEEE.其他的源具有2-2下拉比或与视频摄影机有关，正如本领域技术人员已知的那样。 Other sources having a ratio of 2-2 pull down or a video camera which, as known to the person skilled in that.因此，对接连的场进行比较就可以产生有关所使用的活动图象源的信息。 Therefore, comparing successive field can generate information about the moving images of the source used.

US-A-5,365,280提出对于不同的场使用不同的运动矢量，以及产生一种图象信号处理模式控制信号，它可被电视接收机作为关于该场是涉及到电影片还是非电影片的指示来使用。 US-A-5,365,280 proposes using different fields for different motion vectors, and generating an image signal processing mode control signals, which may be a television receiver as an indication of the field in relation to the film or sheet to a non-film sheet use.

运动估值算法可以在M.Tekalp的“Digital VideoProcessing(数字视频处理)”，Prentice Hall，ISBN 0-13-190075-7中找到。 Motion estimation algorithm, Prentice Hall, found in 0-13-190075-7 M.Tekalp in the "Digital VideoProcessing (digital video processing)" ISBN.基于对象的运动估值方法的概貌由Paolo Vicari给出：“Representation and regularization of motion fields withregion-based models(基于区域的模型的运动的场的代表和正规化)”对于Politecnico di Milano的论文，No.598034。 Motion estimation method based on the profile of the object is given by Paolo Vicari: "Representation and regularization of motion fields withregion-based models (based on motion field model representing the region and normalized)" For Politecnico di Milano paper, No .598034.

发明概要至今为止，现有技术集中在检测具有起源于单个活动图象源的场的影片或具有起源于两个或多个不同的活动图象源的相继场的影片的活动图象源。 SUMMARY OF THE INVENTION To date, the prior art focused on the detection of active video image source having a field originating from a single source or a moving image having a field sequential video originated in two or more different moving picture source.然而，数量不断增加的影片包括起源于不同的活动图象源的场内的图象的混合。 However, a growing number of films including mixed picture originated in different activities within the field of the source image.以上讨论的现有技术方法都不能检测影片的场内的各个不同图象的图象重复模式。 The method of the prior art discussed above video images can not be detected repeating pattern of the various images of the field.然而，例如，在图象速率变换的应用中，需要知道在场内的各个不同图象的起源的指示。 However, for example, the application rate of the image transformation, it is necessary to know the origin of the various images indicative of the field.更具体地，必须知道该视频是否来源于影片内容，以便最佳地执行去交织和影片震颤去除。 More specifically, it is necessary to know whether the video from video content in order to best perform de-interleaving and videos tremor removed.

所以，本发明的目的是提供允许检测在场中的各个不同对象的图象重复模式的设备和方法。 Therefore, an object of the present invention to provide an apparatus and method of the image repetition patterns for detecting the presence of the various objects of allowed.在这方面，“对象”可以是一个场的一个单独的图象的一部分。 In this regard, "subject" may be part of a single image of a field.“对象”被定义为这样一个图象部分，它可以用单个运动模型来描述。 "Object" is defined as a part of the image, which can be described by a single motion model.这样的“对象”不一定必须包括一个“物理的”对象，例如一个人的图象。 Such a "subject" does not necessarily include a "physical" object, such as a person's image.对象可以很好地涉及到一个以上的物理对象，例如，乘坐在自行车上的人，其中人和自行车的运动基本上可以用同一个运动模型来描述。 The objects are well involve more than one physical object, for example, riding on a bicycle, the bicycle and the motion of which can be described with substantially the same motion model.另一方面，可以安全地假设，这样规定的对象属于起源于一个单个影片源的一个单个图象。 On the other hand, it can be safely assumed that the object belongs to a single such predetermined image originated from a single video source.

为了达到本发明的目的，提供了用于检测包括一系列接连的场的影片内容的图象重复模式的方法，该方法包括以下步骤：-建立影片内容的运动参量图案；-把图案与多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定图象重复模式；其特征在于，方法包括以下步骤：·识别在接连的场内的多个不同的对象，一人对象被定义为可以用单个运动模式描述的接连的场的图象部分；·实行以下步骤：-建立接连的场内的每个对象的的运动参量图案；-把运动参量图案与多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定每个对象的图象重复模式。 To achieve the object of the present invention, there is provided a method for detecting an image repeating pattern comprising a series of successive fields of video content, the method comprising the steps of: - establishing a motion video content pattern parameter; - and the plurality of pattern a predetermined pattern of motion parameters for comparison; - determining a repetitive pattern image by using the result of the previous steps; characterized in that, the method comprising the steps of: identifying a plurality of successive field of different objects, one object is defined as successive image portion of the field can be described with a single motion pattern; • implementation of the following steps: - establishing for each successive objects within the field of motion parameters of the pattern; - the pattern of motion parameters with a plurality of predetermined pattern of motion parameters comparison; - determining the image of each object using the result of repeated pattern through the previous steps.

因此，按照本发明，在检测影片模式之前，电视信号的场通过分段技术被分离成不同的对象。 Thus, according to the present invention, prior to detecting the film mode field of the television signal is separated into different objects by segmentation techniques.任何可以达到这个目的已知技术都可以被使用。 This can be achieved in any known techniques can be used.然后，检测每个不同对象的影片模式。 Then, each film mode detection of different objects.任何已知的影片模式检测技术可被使用于这个目的。 Any known film mode detection techniques may be used for this purpose.

优选地，也使用运动参量估值技术。 Preferably, also using the motion parameter estimation techniques.

至今为止，正如本发明人知道的，还没有人尝试使用运动参量估值技术来识别起源于不同的源由于混合而形成的不同的图象部分(对象)。 So far as the present inventors know, it has not been attempted using the motion parameter estimation technique to identify the origin of different portions of the image (object) due to mixing of different sources formed.

本发明也涉及检测包括一系列接连的场的影片内容的图象重复模式的设备，该设备包括处理装置和存储器，处理装置被安排成可执行以下步骤：-建立影片内容的运动参量图案；-把图案与被存储在存储器中的多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定图象重复模式；其特征在于，处理装置被安排成可执行以下步骤：·识别在接连的场内的多个不同的对象，一个对象被定义为可以用单个运动模式描述的接连的场的一个图象部分；·执行以下步骤：-建立接连的场内的每个对象的的运动参量图案；-把运动参量图案与被存储在存储器中的多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定每个对象的图象重复模式。 The present invention also relates to an image detection device comprising a repeating pattern of video content a series of consecutive fields, the apparatus comprising processing means and a memory, the processing means is arranged to perform the following steps: - establishing a motion video content pattern parameter; - the pattern is stored in the memory a plurality of predetermined pattern comparing motion parameters; - determining a repetitive pattern image by using the result of the previous step; wherein the processing means is arranged to perform the following steps: identifying a plurality of different successive field object, an object is defined as a portion of the field of the succeeding picture may be described by a single motion pattern; * performing the following steps: - establishing for each object within the field of motion succeeding pattern parameter; - the pattern of motion parameters stored in the memory a plurality of predetermined pattern comparing motion parameters; - by using the result of the previous step to determine the repetitive pattern of each object image.

这样的设备可以有利地被实施在芯片上。 Such a device may be advantageously implemented on a chip.包括这样的芯片的电视机以及芯片本身也是本发明要求得到专利权保护的。 Such TV chips including the chip itself and also claim the present invention obtained patent protection.

本发明也涉及被计算机设备装载的计算机程序产品，包括用于检测其中包含了一系列接连的场的影片内容的图象重复模式的指令，该设备包括处理装置和存储器，计算机程序产品在被装载后可向处理装置提供执行以下步骤的能力：-建立影片内容的运动参量图案；-把图案与被存储在存储器中的多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定图象重复模式；其特征在于，处理装置被安排成可执行以下步骤：·识别在接连的场内的多个不同的对象，一个对象被定义为可以用单个运动模式描述的接连的场的一个图象部分；·执行以下步骤：-建立接连的场内的每个对象的运动参量图案；-把运动参量图案与被存储在存储器中的多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定每个图案的图象重复模式。 The present invention also relates to a computer program product is loaded in the computer apparatus, comprising means for detecting a series of instructions which contains the image content of the repeating pattern of successive video fields, the apparatus comprising processing means and a memory, the computer program product is loaded after provides the ability to perform the steps of the processing apparatus to: - create a video content movement quantity pattern; - the pattern of the plurality stored in the memory in a predetermined movement parameter pattern comparison; - by using the previous step results to determine repeating image patterns; wherein the processing means is arranged to perform the following steps: identifying a plurality of different successive field object, an object is defined as a successive field can be described by a single motion pattern image portion; * performing the following steps: - establishing for each successive objects within the field of motion parameters of the pattern; - the pattern of motion parameters stored in the memory a plurality of predetermined pattern comparing motion parameters; - using the previous the step of determining the result of the image pattern of each repetitive pattern.

这样的设备可以有利地被实施在芯片上。 Such a device may be advantageously implemented on a chip.包括这样的芯片的电视机以及芯片本身也是本发明要求得到专利权保护的。 Such TV chips including the chip itself and also claim the present invention obtained patent protection.

附图简述现在参照一些附图说明本发明，这些附图只打算用来说明本发明而不是限制它的范围。 BRIEF DESCRIPTION Referring now to the present invention, some of the drawings, these figures only intended to illustrate the invention and not to limit its scope.该范围只由所附上的权利要求限制。 Solely by the scope of the appended claims limit the claim.

图1显示多参量估值器和分段设备的方框图；图2A，2B，2C，2D显示电视屏幕图，这些图说明选择感兴趣的点的处理过程，在这些点上参量估值器使得它们的参量最佳化；图3A，3B，3C，3D显示电视屏幕图，这些图说明分段的处理过程。 1 shows a block diagram of a multi-segment and parameter estimation device; FIG. 2A, 2B, 2C, 2D display television screen diagrams that illustrate the processing of the selected point of interest, at these points so that they parametric estimator the parameter optimization; FIG. 3A, 3B, 3C, 3D TV display screen of FIG, which illustrate the processing segment.

优选实施例详细描述引言下面，提出一种检测在一幅情景图象上各个对象的影片模式的方法。 Detailed Description Introduction The following preferred embodiments, a method of film mode of each object in an image scene detected.为此，首先描述一种识别在一幅情景图象上各个对象的方法。 For this purpose, a method is described in the first scene image on an identification of each object.各个对象是通过运动估值被识别的，即，可以利用同一个运动模式来描述的情景中的那些部分被识别为属于该情景中同一个对象。 Each object is identified by motion estimation, i.e., those portions of the scene can use the same movement pattern will be described in the same object is identified as belonging to this scenario.运动估值器正如从现有技术(例如从[1]，[3]，[4]，[5]，和[6])中知道的那样。 As the motion estimator from the prior art (e.g. from [1], [3], [4], [5] and [6]) as in known.在这些参考文献中，[1]描述了一种运动估值器，它允许识别在情景中的对象，而不需要应用图象分段。 In these references, the [1] describes a motion estimator, which allows the identification of objects in the scene, without the need to apply the image segment.

对于本发明，运动估值器最好是被设计成适合于图象速率变换，它具有的计算复杂性适合于消费电子应用，即，可以与[5，6]相比较的。 For the present invention, the motion estimator is preferably designed to be suitable for picture rate conversion, which has computational complexity suitable for consumer applications, i.e., it may be [5,6] compared.

早先在[1]中描述的对象运动估值器的最显著的特性是在估值模型参量之前没有在把图象分段成对象方面花费力气，正像在其他的现有技术对象运动估值器中那样。 The most significant properties of the object motion estimator described earlier in [1] is not to take the effort in terms of the image is segmented into objects, just as in the other prior art estimates the object motion model parameters before valuation It is in that.基本上，选择相对较小数目的感兴趣的图象部分，以及多个并行运动模型参量估值器试图对这个数据组执行它们的参量的最佳化。 Basically, the relatively small number of selected image portion of interest, and a plurality of parallel motion model parameter estimation tries to perform their optimization parameters of this data set.只要一个估值器在一定的数目的感兴趣的图象部分上比另一个估值器更成功，它就集中在这些部分，而其余估值器集中在其他部分。 As long as a ratio estimator further estimates at a certain number of the image portion of interest of the more successful, it is concentrated in these portions, the remaining portions of the other estimator concentrated.简言之，各个估值器试图打破各个图象部分，把总的图象划分成各个“对象”。 Briefly, each of the estimators attempt to break each part of the image, the overall image into individual "objects."这种现有技术对象运动估值器允许实时的基于对象的运动估值，以及可以有利地使用于本发明的影片检测技术。 This prior art allows real object motion estimator based on the valuation of the moving object, and the film can be advantageously used in the detection technique of the present invention.

基本上，这样的基于对象的运动估值器，没有浪费精力在昂贵的图象分段上，它在运行方面应当能够与基于块的运动估值器竞争，因为人们预期在现实的图象中对象应当比块的数量少。 Basically, this object based motion estimator, without wasting energy on expensive image segment, it should be able to compete based motion estimator block in the operation, since they are expected in the real image objects should be less than the number of blocks.仅仅是在把图象部分分配给对象方面，所需要的努力是与基于块的候选矢量的估值可比较的。 Only in the portion assigned to the object image, and efforts are required to estimate candidate vectors block-based comparable.如果对象的数目没有超过候选矢量数目太多，则基于对象的运动估值器的附加开销应当是可忽略的。 If the number of objects does not exceed the number of candidate vectors too, the additional overhead object based motion estimator should be negligible.这里假设，每个对象的运动可以用相当简单的参量模型来描述。 It is assumed that the motion of each object can be relatively simple to describe the model parameters.

在以下的段落中，我们将描述所使用的优选的运动模型、运动模型参量的估值、所使用的优选的费用函数、分段处理过程、和一个情景内各个对象的影片模型检测。 In the following paragraphs we will use the description of a preferred motion model, the motion estimates model parameters, used is preferably the cost function, segmentation process, each object model checking, and videos in a scene.

运动模型为了减小复杂性，每个对象o的运动用简单的一阶线性模型来描述，一阶线性模型只能描述变换和缩放。 To reduce the complexity of the motion model, the motion of each object o with a simple first order linear model to describe the first-order linear model can only describe the transformation and scaling.更复杂的参量运动模型对于本领域技术人员是已知的，例如，各种包括旋转的模型，它们确实可以与所建议的算法相组合地应用，但这里将不考虑。 More complex parametric motion models of ordinary skill in the art are known, e.g., including various models of rotation, they can indeed be applied in combination with the proposed algorithm, but will not be considered here.因为我们将引入一种可使得这样的复杂的模型过时的改进。 Because we will make an introduction of such complex models obsolete improvement.

参量估值给定运动模型后，接着需要把它的参量对于图象中的给定的对象进行最佳化。 Parameter estimation after a given motion model, and then it needs to be optimized for the parameters given in the object image.由于在几乎每个情景中出现静止图象部分，我们假设存在一个对象o，o＞0'，它的运动用 Since the still picture portion occurs in almost every scenario, we assume the existence of an object o, o> 0 ', with its movement(零矢量)来描述。 (Zero vector) will be described.显然，为了使得这是可提供的，并不需要进行估值。 Obviously, this is in order to make available, it does not need to be valued.附加的对象o，o＞0，的参量矢量由如图1所示的、它们的各个参量估值器(PEm，m＝1，2，...，M)被并行地分开地估值。 , Their respective parameter estimator (PEm, m = 1,2, ..., M) are valued separately additional objects o, o> 0, the vector of parameters as shown by FIG 1 in parallel.

图1显示具有多个被并行地连接到数据减缩单元DRU的输出端的参量估值器PEm(n)的设备的方框图。 1 shows a block diagram of a plurality of parallel data reduction unit is connected to the output of the DRU parameter estimator PEm (n) having a device.数据减缩单元DRU被安排来选择一组感兴趣的、被用于进行计算的图象象素。 DRU data reduction unit is arranged to select a set of interest is calculated for the image pixel.施加到数据减缩单元DRU的输入是在时间n的图象和在时间n-1的所述图象。 Applied to the data input unit DRU is a reduction in the time n and the video image at time n-1,.PEm(n)的每个输出被连接到分段单元SU。 PEm (n) for each segment is connected to the output unit SU.

分段单元SU的输出被反馈到参量估值器PEm(n)，因为它们优选地一起执行递归运行，正如下面将说明的那样。 Segment output is fed back to unit SU parametric estimator PEm (n-), because they are preferably performed with a recursive operation, as will be explained below.分段处理过程的最终结果由一个情景的一些象素组形成，每个象素组属于不同的对象以及把它分配给不同的运动矢量。 The final result of segmentation process is formed by a number of pixels in a group scenario, each of the pixel groups belonging to different objects, and assign it to a different motion vectors.这些输出数据被提供给处理单元PU，该单元被安排来检测每个对象的影片源的类型，并且对不同的对象执行预定的任务，诸如图象速率变换。 The output data is supplied to the processing unit PU, the cell is arranged to detect the type of video source each object, and performing a predetermined task for different objects, such as picture rate conversion.处理单元PU被连接到存储器M，后者存储被使用来检测影片源的类型的、预定的运动参量图案，正如在下面说明的那样。 The processing unit PU is connected to a memory M, which stores is used to detect the type of video source, a predetermined pattern of motion parameters, as described below As.存储器可以是任何已知的类型，即，RAM、ROM、EEPROM、硬盘等等。 The memory may be of any known type, i.e., RAM, ROM, EEPROM, hard disk and the like.处理单元PU的输出用于例如控制电视屏幕。 Output processing unit PU for controlling the television screen, for example.

可以看到，数据减缩单元DRU、参量估值器PEm(n)、分段单元SU和处理单元PU被显示为分开的模块。 Can be seen, data reduction unit the DRU, PEm parameter estimator (n-), segmentation unit SU and the processing unit PU are shown as separate modules.这些模块可被实施为具有不同的处理器和存储器的、分开的智能单元。 These modules may be implemented as separate units having different intelligent processor and a memory.然而，正如本领域技术人员将会看到的，这些单元可被集成为单个单元，诸如包括处理器和装载有适用的软件的适用的存储器的通用微处理器。 However, as those skilled in the art will be seen, these units may be integrated into a single unit, including a general purpose microprocessor such as a processor and suitable software is loaded with a suitable memory.这样的微处理器未显示出，但可以从任何计算机手册中知道。 Such a microprocessor not shown, but may know from any computer manuals.替换地，图1所示的设备可以以硬连线的逻辑单元的形式来实施，正如本领域技术人员知道的那样。 Alternatively, the device shown in FIG. 1 may be implemented in the form of hard-wired logic unit, as those skilled in the art know it.优选地，图1所示的整个设备作为单个芯片被封装在单个组件中。 Preferably, the entire apparatus shown in FIG. 1 as a single chip is encapsulated in a single component.这样的单个芯片组件可以容易地被包括在电视设备中。 Such single chip assembly can be easily included in a television apparatus.

每个PEm(n)更新先前估值的参量矢量，然后最佳的参量候选矢量按照费用函数被选择为该对象的结果参量矢量。 Each PEm (n) to update the previous parameter vector estimates, then the best candidate vector parameter is selected according to cost function results vector for the object parameters.考虑公式(1)的四参量模型，对象o，o＞0，的参量被看作为参量矢量 Consider equation (1) four-parameter model, the object o, o> 0, the parameter is regarded as a variable vectorP&RightArrow;o(n)=sx(o,n)sy(o,n)dx(o,n)dy(o,n)---(2)]]>我们规定我们的任务是从多个候选参量矢量 P & RightArrow; o (n) = sx (o, n) sy (o, n) dx (o, n) dy (o, n) --- (2)]]> we set our task from a plurality of candidate parameter vector中选择 Select作为以后我们将说明的具有费用函数的最小数值的参量矢量。 Having a minimum value of the cost function will be described later in the parameter vector.

优选地，候选者是以非常类似于在[5，6]中利用的策略被产生的，即，取预测矢量，添加至少一个更新矢量，和按照误差准则选择最佳的候选的参量矢量。 Preferably, the candidate is very similar to the use of [5,6] of the policy is generated, i.e., taking the predicted vector, adding at least one update vector, and selecting the best candidate parameter vector in accordance with an error criterion.候选参量组CSo(n)按照以下公式包含三个候选者 Candidate set of parameters CSo (n) comprises three candidates according to the equationCSo(n)={C&RightArrow;o(n)|C&RightArrow;o(n)=P&RightArrow;o(n-1)+mU&RightArrow;o(n),U&RightArrow;o(n)&Element;USo(n),m=-1,0,1}---(3)]]>其中更新参量 CSo (n) = {C & RightArrow; o (n) | C & RightArrow; o (n) = P & RightArrow; o (n-1) + mU & RightArrow; o (n), U & RightArrow; o (n) & Element; USo (n), m --- = -1, 0} (3)]]> wherein updating parameters是从更新参量组USo(n)中选择的：USo(n)={i000,0i00,00i0,000i}---(4)]]>(i＝1，2，4，8，16)费用函数给定运动模型和某些候选参量组后，我们需要按照费用函数选择最佳的候选者作为给定的对象的结果。 Is selected from the group parameter update USo (n) in: USo (n) = {i000,0i00,00i0,000i} --- (4)]]> (i = 1,2,4,8,16) Cost after the function of a given motion model parameters and certain candidate group, we need to choose the best candidate as a result of a given object in accordance with the cost function.费用函数可以是在来自相邻的图象的运动补偿的象素之间的绝对差值的和值，并且矢量利用(候选)矢量模型来产生。 And the cost may be a function of the absolute differences between pixels from adjacent image motion compensation, and the use of the vector (candidate) to generate the vector model.然而，我们需要知道运动模型将要被分配到的区域。 However, we need to know the area motion model to be allocated.两个问题，(即分段和运动估值)是互相依赖的。 Two problems, (i.e. segmentation and motion estimation) are mutually dependent.为了正确地估计在一个对象中的运动，应当知道对象以及反之亦然。 To move an object to correctly estimate, you should know the object and vice versa.

作为运动估值处理过程的第一步，我们规定感兴趣的象素块为一个组，它们形成“感兴趣的”图象部分的组SI(n)，它们将被用作为使得所有的参量模型最佳化的基础。 As a first step in the motion estimation process, we are interested in a predetermined pixel block as one group, they form a group SI (n) of the image portion "of interest", as they are used such that all the model parameters best of the foundation.

现在，各个参量估值器的焦点必须是在不同的对象上。 Now, each parameter estimator focus must be on different objects.为此，每个参量估值器PEm(n)将对于在组SI中规定的同一组感兴趣的位置计算它的费用函数，对不同的位置给出不同的加权因子 To this end, each parameter estimator PEm (n) to calculate its cost function to a predetermined location on the same set of interest in the group of SI, given different weighting factors for different positions。 .这里， Here,与象素块的位置有关。 Related to the position of the pixel block.建议的算法是直截了当的：·按照先前的图象分段步骤，在SI(n)中的象素属于同一个对象，(即，同一个参量估值器)的情形下，象素数值与大于1的第一加权因子(例如，8)相乘。 The proposed algorithm is straightforward: · segmentation step according to the previous picture, the pixels in the SI (n) belong to the same object (i.e., the same parameter estimators) of the case, the pixel values ​​is greater than a first weighting factor (e.g., 8) is multiplied.

·在分段把位置分配给另一个参量估值器以及这个估值器达到低的匹配误差的情形下，象素数值与小于1的第一加权因子(例如，0.1)相乘。 · In the case of the segment to a dispensing position to another parameter estimator and the estimator reaches a low matching error, the pixel values ​​of the first weighting factor (e.g., 0.1) of less than 1 is multiplied.

图2给出在具有单个运动对象(即，骑自行车者)和运动背景的图象中选择感兴趣的象素块的例子。 Figure 2 shows a moving object having a single (i.e., cyclists) Examples pixel block of interest and moving background image selected.这个选择是由数据减缩单元DRU实行的。 This selection is implemented data reduction unit DRU.因此，数据减缩单元呈现出一组最感兴趣的象素单元(SI)，导致相当便宜(很少的计算)和有效的参量估值。 Thus, data reduction unit presents a set of pixel elements of most interest (the SI), resulting in relatively inexpensive (less calculation) and an effective amount parameter estimates.图2显示的屏幕图说明选择感兴趣的点的处理过程，在这些点上参量估值器可以使得它们的参量最佳化。 Figure 2 shows the screen of FIG selection process is described point of interest, at these points so that the parameter estimation can be optimized parameters thereof.在两个接连的图象之间的时间差值图象不是被实际计算的，但它可被用来理解为什么矢量 In the time difference between the image of the two successive images are not actually calculated, but it can be used to understand why the vector(即，具有感兴趣的点的整个组)的高的匹配误差是在图3C所示的位置上。 (I.e., having the whole set of points of interest) high match error is the position shown on FIG. 3C.在图3D上，显示了本例中两个参量估值器的注意力如何被分开在这些感兴趣的点。 In 3D, the present example shows how two variables estimator attention is separated these points of interest.即，图3D显示：有两个不同的运动模型被检测到。 That is, FIG. 3D show: There are two different motion models are detected.两个子组以不同的亮度被显示，即一个是黑色，另一个是灰色。 Two sub-groups are displayed in different brightness, i.e. one is black and the other gray.

图象的运动背景是对象o＝1，骑自行车者是对象o＝2。 Moving the background image is the object o = 1, the cyclist is the object o = 2.有两个参量估值器，它们都对包含感兴趣的块的同一个组进行最佳化，但只要一个估值器在分段时被选择为区域中最佳的，则在该区域中感兴趣的象素块在费用函数上就是突出的。 There are two parameters estimator, which are optimized for the same block of interest comprise, but as long as an estimate is selected when the best segment region, the sense of the region in pixel blocks of interest in the cost function is outstanding.然后，这将收敛到所显示的情形，在其中一个估值器聚焦在灰色块，另一个估值器聚焦在SI(n)中的白色象素块。 Then, it will converge to the case shown, in which a gray blocks estimator focused, the focus in the other estimator SI (n) is a white pixel block.

二次采样有效地减小所需要的存储器带宽。 Subsampling effectively reduce the memory bandwidth required.图象以一个场为基础用等于4的因子被水平地二次采样和用等于2的因子被垂直地二次采样，从而从每个原先的场F(n)中产生二次采样的图象Fs(n)。 In a video field basis by a factor equal to 4 is horizontally subsampled by a factor equal to 2 and is vertically sub-sampled to produce sub-sampled from each of the previous field F (n) of the image Fs (n).为了在F的原先的象素栅格上达到象素精度，需要在二次采样的栅格上进行内插。 For F in the pixel grid to reach the original pixel precision interpolation is required in the subsampling grid.

我们采用这样一种递归分段方法，它非常近似于3-DRS块匹配器的策略，例如，如在[5]中揭示的，即，使用最佳PEm(n)的空间和时间预测，以及惩罚对于在空间-时间邻居中没有出现的PEm(n)的选择。 We employed a recursive segmentation method that is very similar to policy block matcher 3-DRS, e.g., as disclosed in [5], i.e., using the best PEm (n) spatial and temporal prediction, and for punishment in space - select PEm (n) time does not appear in the neighbor's.正式地，分段掩膜片 Formally, segmented mask sheet把具有最低的局部修改的费用函数 The cost function has the lowest local modification of的对象o分配给块 Blocks assigned to the object o，其中&epsiv;o&prime;=&epsiv;o+P(X&RightArrow;,n)]]>而 Wherein & epsiv; o & prime; = & epsiv; o + P (X & RightArrow;, n)]]> and是按照以下的法则选择的惩罚因子： Is selected according to the following rules to punish factors:和&delta;&RightArrow;=ij,i,j=0,&PlusMinus;1---(9)]]>类似于对于3-D RS块匹配器所建议的内容[5]，Pu是最大惩罚因子，Pt只是小的惩罚因子，虽然不存在为什么Ps不能正好是零的理由。 And & delta; & RightArrow; = ij, i, j = 0, & PlusMinus; 1 --- (9)]]> similar to the contents of 3-D RS block matcher suggested [5], Pu is the maximum penalty factor, Pt only a small penalty factor, although there is no reason why the Ps is not exactly zero.一个相当明显的简化是把 A fairly obvious is to simplify固定到与扫描方向相反的方向，以及逐场地交替扫描。 Fixed to the opposite scanning direction, and alternately by field scan.图3A-3D给出按照基于对象的运动估值方法的分段的例子，其中具有原先的亮度的图象。 Figures 3A-3D are given according to the example segment motion estimation method based on the object, wherein the original image has a luminance.图3A-3D显示从电视屏幕上拍摄的并且可以说明分段处理过程的照片。 Figures 3A-3D show taken from the television screen and may be segmented photograph illustrating a processing procedure.图3A显示原先的图象，而图3B-3D显示接连的分段结果。 3A shows the original image, and FIG. 3B-3D show the results of successive segments.显然，图3A上的第一图象具有很差的、几乎是随机的分段，然而，各个估值器在分段时集中在它们的区域，从而快速地收敛到有用的分段：图3D显示可以区分两个不同的对象，一个涉及到骑自行车者以及一个涉及到背景。 Clearly, on the first image having poor FIG. 3A, almost random segments, however, each segment estimator when concentrated in a region thereof, thereby rapidly converge to a useful segment: FIG. 3D We can distinguish between two different display objects related to a cyclist, and relates to a background.

现有技术影片模式识别与对于图象中的每个对象的运动矢量的计算不同，在图象速率变换中的应用需要图象系列的原点的表示。 Prior art film pattern recognition and image motion vector is calculated for each object in different origin applications require images in the image series showing the rate of conversion.更具体地，必须知道该视频是否起源于影片内容，以便最佳地执行去交织和影片震颤去除。 More specifically, it is necessary to know whether the video originated in the video content, in order to optimally perform de-interleaving and videos tremor removed.而且，必须在2-2下拉图象内容、2-3下拉图象内容、和来自视频摄影机的视频之间进行区分。 Further, the image content must be in the 2-2 pull-down, 2-3 pull-down video content, and to distinguish between the video from a video camera.

正如这里先前讨论的，在现有技术方法中，这个检测关系到全局判决，即，在视频摄影机与各个影片格式之间的鉴别只对于整个图象来进行。 As previously discussed herein, in the prior art methods, the relationship between the detected overall decision, i.e., discrimination between the video camera and the respective video format is only performed for the entire image.

由于[8]的适配性，对于基于对象的运动估值器，可靠的电影检测器可以通过分析只由覆盖最大的图象区域的参量估值器描述的运动而实现，它明显地不考虑零矢量“估值器”。 [8] the suitability, for object-based motion estimator, reliable film detector may be achieved by analyzing only the movement described by the maximum image area covered parametric estimators because it apparently does not consider zero vector "estimator."

让我们把max(n)定义为参量矢量Po(n)的最大分量(在此，不取参量矢量的最大分量，同样有可能地使用其中任一个或所有两个参量分量的平均值、绝对值、或相加的绝对值)，即，max(n)＝max{sx(o，n)，sy(o，n)，dx(o，n)，dy(o，n)} (10)我们现在把最新的历史组RH(n)组合为：RH(n)＝{max(n)，max(n-1)，max(n-2)，max(n-3)，max(n-4)，max(n-5)，max(n-6)}(11)它可以通过取适合的门限值而被变换成二进制电影检测组MD(n)，对于2-2下拉，将给出如下的结果：MD(n)＝{0，1，0，1，0，1，0}， (12)对于2-3下拉给出如下的结果：MD(n)＝{0，1，0，0，1，0，1}， (13)以及对于视频给出如下的结果：MD(n)＝{1，1，1，1，1，1，1}， (14)把该实际的组与被存储在存储器M中的有限数目的已知图案进行比较，从而产生关于电影类型和阶段的信息。 Let max (n) is defined as a parameter vector Po (n) of the maximum component (here, do not take maximum component parameter vector is likewise possible to use the average of either one or both of parameters of components, the absolute value , or the sum of absolute value), i.e., max (n) = max {sx (o, n), sy (o, n), dx (o, n), dy (o, n)} (10) we now the latest history group RH (n) a combination of: RH (n) = {max (n), max (n-1), max (n-2), max (n-3), max (n-4 ), max (n-5), max (n-6)} (11) which can be transformed by taking the threshold value for detecting the film into binary group MD (n), for the 2-2 pull-down, will be given the following results: MD (n) = {0,1,0,1,0,1,0}, (12) given below for 2-3 pulldown results: MD (n) = {0,1,0 , 0,1,0,1}, (13) given below and the results for the video: MD (n) = {1,1,1,1,1,1,1}, (14) to the actual group compared with the limited number of known patterns are stored in the memory M to generate information on the type and stage of the film.在景象切割的情形下，检测器产生输出不可靠，这表示运动补偿最好是被加以关断。 In the case of a scene cut, the detector produces an unreliable output, which represents the best motion compensation is to be turned off.

按照本发明的影片模式识别本发明涉及一种检测在情景中各个对象的影片模式的方法。 It relates to a method of detecting film mode of each object in the scene in the film according to the present invention, a method of the present invention the pattern recognition.来自不同的源的越来越多的图象在产生期间被混合。 More and more images from different sources are mixed during production.所以，我们建议采用基于对象的运动估值器，这样，它连同在情景中的对象的运动参量估值一起，决定它们的原点。 Therefore, we recommend the use of object-based motion estimator, so that together with the parameters of a moving object in the valuation scenario together, determine their origin.

为此，我们分析由所有的各个参量估值器描述的运动。 To this end, we analyze the motion is described by all the various parameters estimators.

让我们把maxo(n)定义为参量矢量Po(n)的最大分量(在此，不取参量矢量的最大分量，同样有可能地使用其中任一个或所有两个参量分量的平均值、绝对值、或相加的绝对值)，即，maxo(n)＝max{sx(o，n)，sy(o，n)，dx(o，n)，dy(o，n)}. (15)我们现在把最新的历史组RHo(n)组合为：RHo(n)＝{maxo(n)，maxo(n-1)，maxo(n-2)，maxo(n-3)，maxo(n-4)，maxo(n-5)，maxo(n-6) (16)它可以通过取适合的门限值而被变换成二进制电影检测组MDo(n)，对于2-2下拉，将给出如下的结果：MDo(n)＝{0，1，0，1，0，1，0}， (17)对于2-3下拉给出如下的结果：MDo(n)＝{0，1，0，0，1，0，1}， (18)以及对于视频给出如下的结果：MDo(n)＝{1，1，1，1，1，1，1}， (19)把实际的组与被存储在存储器M中的有限数目的已知图案进行比较，可以产生对 Let maxo (n) is defined as a parameter vector Po (n) of the maximum component (here, do not take maximum component parameter vector is likewise possible to use the average of either one or both of parameters of components, the absolute value , or the sum of absolute value), i.e., maxo (n) = max {sx (o, n), sy (o, n), dx (o, n), dy (o, n)}. (15) we now bring the latest history group RHo (n) is a combination of: RHo (n) = {maxo (n), maxo (n-1), maxo (n-2), maxo (n-3), maxo (n- 4), maxo (n-5), maxo (n-6) (16) which can be transformed by taking the threshold value for detecting the film into binary group MDo (n), for the 2-2 pull-down, will be given the following results: MDo (n) = {0,1,0,1,0,1,0}, (17) given below for 2-3 pulldown results: MDo (n) = {0,1,0 , 0,1,0,1}, (18) given below and the results for the video: MDo (n) = {1,1,1,1,1,1,1}, (19) the actual group is compared with the limited number stored in the memory M of a known pattern, may be generated for每个各个对象的关于电影类型和阶段的信息。 Information about each individual object type and stage of the film.在景象切割的情形下，检测器产生不可靠的输出，这表示对于所有的对象，运动补偿最好是被加以关断。 In the case of a scene cut detector unreliable output, which means that for all objects, the motion compensation is preferably to be switched off.

[7]G.de Haan和PWACBiezen，“Time-recursive de-interlacing for high-quality television receivers(用于高质量电视机的时间递归去交织)”，Proc.Of the Int.Workshop on HDTVand the Evolution of Television，Taipei，Taiwan，November 1995，pp.8B25-8B33. [7] G.de Haan and PWACBiezen, "Time-recursive de-interlacing for high-quality television receivers (TV's recursion for time deinterleaving quality)", Proc.Of the Int.Workshop on HDTVand the Evolution of Television, Taipei, Taiwan, November 1995, pp.8B25-8B33.

[8]G.de Haan，H.Huijgen，PWACBiezen和OAOjo，“Method and appaatus for discriminating between movie film andnon-movie film and generating a picture signal processing modecontrol signal(用于鉴别电影影片和非电影影片以及产生图象信号处理模式控制信号的方法和设备)”，United States PatentNo.US5,365,280，November 15，1994。 [8] G.de Haan, H.Huijgen, PWACBiezen and OAOjo, "Method and appaatus for discriminating between movie film andnon-movie film and generating a picture signal processing modecontrol signal (for authentication and non-film film film film and generating FIG. method and apparatus for image signal processing mode control signals) ", United States PatentNo.US5,365,280, November 15,1994.

Claims (13)

Translated from Chinese

1.一种用于检测包括一系列接连的场的影片内容的图象重复模式的方法，该方法包括以下步骤：-建立影片内容的运动参量图案；-把图案与多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定图象重复模式；其特征在于，该方法包括以下步骤：·识别在接连的场内的多个不同的对象，一个对象被定义为可以用单个运动模式描述的接连的场的一个图象部分；·实行以下步骤：-建立接连的场内的每个对象的的运动参量图案；-把运动参量图案与多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定每个对象的图象重复模式。 1. A method of detecting the image pattern comprises a repeating series of successive video content of a field, the method comprising the steps of: - establishing a motion video content pattern parameter; - the pattern of motion parameters of the plurality of predetermined patterns comparing; - determining a repetitive pattern image by using the result of the previous step; wherein the method comprises the following steps: identifying a plurality of different successive field object, an object can be defined as a single motion a portion of the field of the successive image pattern description; • implementation of the following steps: - establishing for each successive objects within the field of motion parameters of the pattern; - the pattern of motion parameters with a plurality of predetermined pattern comparing motion parameters; - determining a repetitive pattern of each object image by using the result of the previous step.

2.用于检测包括一系列接连的场的影片内容的图象重复模式的设备，该设备包括处理装置和存储器(M)，处理装置被安排成可以执行以下步骤：-建立影片内容的运动参量图案；-把图案与被存储在存储器中的多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定图象重复模式；其特征在于，处理装置被安排成可执行以下步骤：·识别在接连的场内的多个不同的对象，一个对象被定义为可以用单个运动模式描述的接连的场的一个图象部分；·执行以下步骤：-建立接连的场内的每个对象的的运动参量图案；-把运动参量图案与被存储在存储器中的多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定每个图案的图象重复模式。 Parametric motion video content to establish: - 2. The device for detecting the image comprising a repeating pattern of video content a series of consecutive fields, the apparatus comprising processing means and a memory (M), the processing means is arranged to perform the following steps pattern; - the pattern stored in the memory a plurality of predetermined pattern comparing motion parameters; - determining a repetitive pattern image by using the result of the previous step; wherein the processing means is arranged to perform the following steps: · identifying a plurality of different successive field object, an object is defined as a portion of the image field can be successively with a single motion pattern described; * performing the following steps: - establishing for each successive objects within the field of the motion pattern parameter; - the pattern of motion parameters stored in the memory a plurality of predetermined pattern comparing motion parameters; - by using the result of the previous step to determine the image of each pattern repeat mode.

3.按照权利求2的设备，其中所述处理装置被安排成可通过也使用运动估值技术来识别所述多个不同的对象。 3. The apparatus according to claim 2 request, wherein the processing means is arranged to be used by the motion estimation technique to identify a plurality of different objects.

4.按照权利求3的设备，包括多个运动模型参量估值器(PEm(n))，它们并行地运行，以执行所述运动估值技术。 4. seeking apparatus according to claim 3, comprising a plurality of parameters of the motion model estimator (PEm (n)), which run in parallel, to perform the motion estimation technique.

5.按照权利求2的设备，包括分段单元(SU)，以便执行递归的分段方法来识别所述多个对象。 5. The apparatus according to claim 2, seeking, including segmentation unit (the SU), in order to perform a recursive segmentation method of identifying the plurality of objects.

7.按照权利求2的设备，其中所述预定的运动参量图案涉及到以下的影片模式组中的至少一个影片模式：2-2下拉模式、3-2下拉模式、和视频模式。 7. The apparatus according to claim 2 requirements, wherein said predetermined pattern of motion parameters relate to at least one of the following film mode video mode groups: 2-2 pull-down mode, 3-2 pull-down pattern, and video modes.

9.按照权利求8的设备，其中所述影片处理单元被安排成可执行以下步骤中的至少一个步骤：图象速率变换、去交织、和影片震颤去除。 At least one step 9. The apparatus according to claim 8 requirements, wherein said video processing unit is arranged to perform the following steps: converting an image rate, deinterleaving, and removing the film tremor.

10.提供有按照权利求2到9的任一项的设备的芯片。 10. The chip is provided with a device as claimed in any one seek 2 to 9.

11.提供有按照权利求10的芯片的电视设备。 11. The chip 10 is provided with a demand television apparatus according to claim.

12.被计算机设备装载的计算机程序产品，包括用于检测其中包含了一系列接连的场的影片内容的图象重复模式的指令，该设备包括处理装置和存储器，计算机程序产品在被装载后可向处理装置提供执行以下步骤的能力：-建立影片内容的运动参量图案；-把图案与被存储在存储器中的多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定图象重复模式；其特征在于，处理装置被安排成可执行以下步骤：·识别在接连的场内的多个不同的对象，一个对象被定义为可以用多个运动模式描述的接连的场的一个图象部分；·执行以下步骤：-建立接连的场内的每个对象的的运动参量图案；-把运动参量图案与被存储在存储器中的多个预定的运动参量图案进行比较；-通过使用前面步骤的结果来确定每个图案的图象重复模式。 12. A computer apparatus loaded computer program product, comprising means for detecting a series of instructions which contains the image content of the repeating pattern of successive video fields, the apparatus comprising processing means and a memory, the computer program product, after being loaded can It provides the ability to perform the steps of the processing apparatus to: - create a video content movement quantity pattern; - the pattern of the plurality stored in the memory in a predetermined movement parameter pattern comparison; - by using the previous step results to determine the image repeating pattern; wherein the processing means is arranged to perform the following steps: identifying a plurality of different successive objects within the field, an object is defined in a field of FIG successive plurality of motion patterns that can be described by like portion; * performing the following steps: - establishing for each object successive field of a motion parameter pattern; - the movement parameter pattern and the plurality stored in the memory in a predetermined movement parameter pattern comparison; - using the previous the step of determining the result of the image pattern of each repetitive pattern.

13.提供有按照权利求12的计算机程序产品的数据载体。 13. Data carrier provided with a computer program seeking product according to claim 12.

CN 028000582001-01-112002-01-04Recognizing film and video objects occuring in parallel in single television signal fields
CN1456015A
(en)