Varjo Patent | Systems and methods for visually indicating stale content in environment model

编辑：映维 | 分类：Varjo | 2023年8月3日

Patent: Systems and methods for visually indicating stale content in environment model

Publication Number: 20230245408

Publication Date: 2023-08-03

Assignee: Varjo Technologies Oy

Abstract

A system includes server(s) configured to: receive plurality of images of real-world environment captured by camera(s); process a number of images to detect plurality of objects present in a real-world environment and generate a three-dimensional environment model of the real-world environment; classify each of the objects as either a static or dynamic object; receive current image(s) of the real-world environment; process the current image(s) to detect object(s); determine whether or not the object(s) is/are from amongst the plurality of objects; determine whether the object(s) is a static object or dynamic object when it is determined that the object(s) is/are from amongst the plurality of objects; and for each dynamic object that is represented in the three-dimensional environment model but not in a current image(s), apply a first visual effect to a representation of the dynamic object in the three-dimensional environment model for indicating staleness of the representation.

Claims

1.A system comprising at least one server configured to: receive a plurality of images of a real-world environment captured by at least one camera; process the plurality of images to detect a plurality of objects present in the real-world environment and generate a three-dimensional environment model of the real-world environment, wherein the three-dimensional environment model represents the plurality of objects; classify each of the plurality of objects as either a static object or a dynamic object; receive at least one current image of the real-world environment captured by the at least one camera; process the at least one current image to detect at least one object represented therein; determine whether or not the at least one object is from amongst the plurality of objects; determine whether the at least one object is a static object or a dynamic object when it is determined that the at least one object is from amongst the plurality of objects; and for each dynamic object that is represented in the three-dimensional environment model but not in the at least one current image, apply a first visual effect to a representation of said dynamic object in the three-dimensional environment model for indicating staleness of said representation.

2.The system of claim 1, wherein, when applying the first visual effect to the representation of said dynamic object in the three-dimensional environment model, the at least one server is configured to: determine whether or not said dynamic object lies within a field of view of the at least one camera; and select the first visual effect to be applied as one of: fading, desaturation, when it is determined that said dynamic object does not lie within the field of view of the at least one camera.

3.The system of claim 1, wherein, when applying the first visual effect to the representation of said dynamic object in the three-dimensional environment model, the at least one server is configured to: determine whether or not said dynamic object lies within a field of view of the at least one camera; determine whether said dynamic object is occluded by another dynamic object or by a static object in the at least one current image, when it is determined that said dynamic object lies within the field of view of the at least one camera; select the first visual effect to be applied as one of: fading, desaturation, when it is determined that said dynamic object is occluded by the static object in the at least one current image; and select the first visual effect to be applied as at least one of: fading, deleting said dynamic object, when it is determined that said dynamic object is occluded by another dynamic object in the at least one current image.

4.The system of claim 3, wherein when it is determined that said dynamic object is occluded by another dynamic object in the at least one current image, the first visual effect is applied to the representation of said dynamic object in the three-dimensional environment model in a manner that said dynamic object is faded initially and is deleted from the three-dimensional environment model after a predefined time period from fading.

5.The system of claim 1, wherein the at least one server is further configured to, for each static object that is represented in the three-dimensional environment model but not in the at least one current image, apply a second visual effect to a representation of said static object in the three-dimensional environment model for indicating staleness of said representation.

6.The system of claim 5, wherein the second visual effect is darkening a tint of said static object.

7.The system of claim 1, wherein the at least one server is further configured to, for each object that is represented in the three-dimensional environment model and also in the at least one current image, update a representation of said object in the three-dimensional environment model using a representation of said object in the at least one current image.

8.The system of claim 7, wherein when said object is a dynamic object and the at least one current image comprises a sequence of current images captured by the at least one camera, the at least one server is further configured to: process the sequence of current images to detect a sequence of locations of the dynamic object as represented therein; and update the three-dimensional environment model by deleting at least one previous representation of the dynamic object from at least one previous location from amongst the sequence of locations.

9.The system of claim 1, wherein the at least one server is further configured to, for each region of the real-world environment that is represented in the three-dimensional environment model but not in the at least one current image, apply at least one third visual effect to a representation of said region, the at least one third visual effect being at least one volumetric effect.

10.The system of claim 1, wherein when applying a given visual effect, the at least one server is configured to control at least one of: an intensity of the given visual effect, a colour associated with the given visual effect, a speed of applying the given visual effect, a time duration of applying the given visual effect, removal of the given visual effect.

11.The system of claim 1, wherein the at least one server is further configured to: receive, from at least one client device, information indicative of a given pose of at least one client device; utilise the three-dimensional environment model to generate at least one reconstructed image from a perspective of the given pose of the at least one client device; and send the at least one reconstructed image to the at least one client device for display thereat.

12.A method comprising: receiving a plurality of images of a real-world environment captured by at least one camera; processing the plurality of images to detect a plurality of objects present in the real-world environment and generate a three-dimensional environment model of the real-world environment, wherein the three-dimensional environment model represents the plurality of objects; classifying each of the plurality of objects as either a static object or a dynamic object; receiving at least one current image of the real-world environment captured by the at least one camera; processing the at least one current image to detect at least one object represented therein; determining whether or not the at least one object is from amongst the plurality of objects; determining whether the at least one object is a static object or a dynamic object when it is determined that the at least one object is from amongst the plurality of objects; and for each dynamic object that is represented in the three-dimensional environment model but not in the at least one current image, applying a first visual effect to a representation of said dynamic object in the three-dimensional environment model for indicating staleness of said representation.

13.The method of claim 12, wherein the step of applying the first visual effect to the representation of said dynamic object in the three-dimensional environment model comprises: determining whether or not said dynamic object lies within a field of view of the at least one camera; and selecting the first visual effect to be applied as one of: fading, desaturation, when it is determined that said dynamic object does not lie within the field of view of the at least one camera.

14.The method of claim 12, the step of applying the first visual effect to the representation of said dynamic object in the three-dimensional environment model comprises: determining whether or not said dynamic object lies within a field of view of the at least one camera; determining whether said dynamic object is occluded by another dynamic object or by a static object in the at least one current image, when it is determined that said dynamic object lies within the field of view of the at least one camera; selecting the first visual effect to be applied as one of: fading, desaturation, when it is determined that said dynamic object is occluded by the static object in the at least one current image; and selecting the first visual effect to be applied as at least one of: fading, deleting said dynamic object, when it is determined that said dynamic object is occluded by another dynamic object in the at least one current image.

15.The method of claim 14, wherein when it is determined that said dynamic object is occluded by another dynamic object in the at least one current image, the first visual effect is applied to the representation of said dynamic object in the three-dimensional environment model in a manner that said dynamic object is faded initially and is deleted from the three-dimensional environment model after a predefined time period from fading.

16.The method of claim 12, further comprising, for each static object that is represented in the three-dimensional environment model but not in the at least one current image, applying a second visual effect to a representation of said static object in the three-dimensional environment model for indicating staleness of said representation.

17.The method of claim 16, wherein the second visual effect is darkening a tint of said static object.

18.The method of claim 12, further comprising, for each object that is represented in the three-dimensional environment model and also in the at least one current image, updating a representation of said object in the three-dimensional environment model using a representation of said object in the at least one current image.

19.The method of claim 18, wherein when said object is a dynamic object and the at least one current image comprises a sequence of current images captured by the at least one camera, the method further comprises: processing the sequence of current images to detect a sequence of locations of the dynamic object as represented therein; and updating the three-dimensional environment model by deleting at least one previous representation of the dynamic object from at least one previous location from amongst the sequence of locations.

20.The method of claim 12, further comprising, for each region of the real-world environment that is represented in the three-dimensional environment model but not in the at least one current image, applying at least one third visual effect to a representation of said region, the at least one third visual effect being at least one volumetric effect.

21.The method of claim 12, wherein the step of applying a given visual effect comprises controlling at least one of: an intensity of the given visual effect, a colour associated with the given visual effect, a speed of applying the given visual effect, a time duration of applying the given visual effect, removal of the given visual effect.

22.The method of claim 12, further comprising: receiving, from at least one client device, information indicative of a given pose of at least one client device; utilising the three-dimensional environment model to generate at least one reconstructed image from a perspective of the given pose of the at least one client device; and sending the at least one reconstructed image to the at least one client device for display thereat.

Description

TECHNICAL FIELD

The present disclosure relates to systems for visually indicating stale content in environment models. The present disclosure also relates to methods for visually indicating stale content in environment models.

BACKGROUND

In recent times, there has been an ever-increasing demand for image generation and processing. For example, such a demand may be quite high and critical in case of evolving technologies such as immersive extended-reality (XR) technologies which are being employed in various fields such as entertainment, real estate, training, medical imaging operations, simulators, navigation, and the like. Several advancements are being made to develop image generation and processing technology. Typically, three-dimensional (3D) models (for example, in form of 3D polygonal mesh, 3D point cloud, 3D grid, and the like) of real-world environments are generated and are subsequently employed as input for generating images to be displayed. The 3D models are typically created using images of the real-world environments that are captured by one or more cameras.

Presently, the 3D models are quite limited in terms of accurately representing an entirety of the real-world environments, as at any given time, the one or more cameras only provide only a partial coverage of the real-world environment in real time and representations in the 3D models of anything that is not presently in view of the one or more cameras are not updated. This leads to stale (i.e., obsolete or old) representations being present in the 3D models. This issue is extremely problematic in case of dynamic objects (i.e., objects which are prone to change in their properties, for example, such as a change in their locations and/or orientations) present in the real-world environments. For example, if a person standing in a room at a first location is imaged in a first image, and if the person later moves to a second location and is imaged in a second image, the 3D model generated using the first and second images would contain two representations of the same person at two different locations. Herein, one of the representations, i.e., the representation generated using the first image, is inaccurate as it is based on a stale information about location of the person. Moreover, when images are reconstructed using such inaccurate 3D models, such images do not accurately represent the objects of the real-world environment and are often unrealistic.

Therefore, in light of the foregoing discussion, there exists a need to overcome the aforementioned drawbacks associated with accuracy of 3D models of real-world environments.

SUMMARY

The present disclosure seeks to provide a system for visually indicating stale content in an environment model. The present disclosure also seeks to provide a method for visually indicating stale content in an environment model. An aim of the present disclosure is to provide a solution that overcomes at least partially the problems encountered in prior art.

In one aspect, an embodiment of the present disclosure provides a system comprising at least one server configured to:

receive a plurality of images of a real-world environment captured by at least one camera;

process the plurality of images to detect a plurality of objects present in the real-world environment and generate a three-dimensional environment model of the real-world environment, wherein the three-dimensional environment model represents the plurality of objects;

classify each of the plurality of objects as either a static object or a dynamic object;

receive at least one current image of the real-world environment captured by the at least one camera;

process the at least one current image to detect at least one object represented therein;

determine whether or not the at least one object is from amongst the plurality of objects;

determine whether the at least one object is a static object or a dynamic object when it is determined that the at least one object is from amongst the plurality of objects; and

for each dynamic object that is represented in the three-dimensional environment model but not in the at least one current image, apply a first visual effect to a representation of said dynamic object in the three-dimensional environment model for indicating staleness of said representation.

In another aspect, an embodiment of the present disclosure provides a method comprising:

receiving a plurality of images of a real-world environment captured by at least one camera;

processing the plurality of images to detect a plurality of objects present in the real-world environment and generate a three-dimensional environment model of the real-world environment, wherein the three-dimensional environment model represents the plurality of objects;

classifying each of the plurality of objects as either a static object or a dynamic object;

receiving at least one current image of the real-world environment captured by the at least one camera;

processing the at least one current image to detect at least one object represented therein;

determining whether or not the at least one object is from amongst the plurality of objects;

determining whether the at least one object is a static object or a dynamic object when it is determined that the at least one object is from amongst the plurality of objects; and

for each dynamic object that is represented in the three-dimensional environment model but not in the at least one current image, applying a first visual effect to a representation of said dynamic object in the three-dimensional environment model for indicating staleness of said representation.

Embodiments of the present disclosure substantially eliminate or at least partially address the aforementioned problems in the prior art, and enable accurate and real-time visual indication of stale content in a three-dimensional environment model of a real-world environment.

Additional aspects, advantages, features and objects of the present disclosure would be made apparent from the drawings and the detailed description of the illustrative embodiments construed in conjunction with the appended claims that follow.

It will be appreciated that features of the present disclosure are susceptible to being combined in various combinations without departing from the scope of the present disclosure as defined by the appended claims.

BRIEF DESCRIPTION OF THE DRAWINGS

The summary above, as well as the following detailed description of illustrative embodiments, is better understood when read in conjunction with the appended drawings. For the purpose of illustrating the present disclosure, exemplary constructions of the disclosure are shown in the drawings. However, the present disclosure is not limited to specific methods and instrumentalities disclosed herein. Moreover, those skilled in the art will understand that the drawings are not to scale. Wherever possible, like elements have been indicated by identical numbers.

Embodiments of the present disclosure will now be described, by way of example only, with reference to the following diagrams wherein:

FIGS. 1A and 1B are block diagrams of architectures of a system for visually indicating stale content in an environment model, in accordance with different embodiments of the present disclosure;

FIGS. 2A and 2B collectively are an illustration of steps of a method for visually indicating stale content in an environment model, in accordance with an embodiment of the present disclosure; and

In the accompanying drawings, an underlined number is employed to represent an item over which the underlined number is positioned or an item to which the underlined number is adjacent. A non-underlined number relates to an item identified by a line linking the non-underlined number to the item. When a number is non-underlined and accompanied by an associated arrow, the non-underlined number is used to identify a general item at which the arrow is pointing.

DETAILED DESCRIPTION OF EMBODIMENTS

The following detailed description illustrates embodiments of the present disclosure and ways in which they can be implemented. Although some modes of carrying out the present disclosure have been disclosed, those skilled in the art would recognize that other embodiments for carrying out or practising the present disclosure are also possible.