<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Networked Video in 10 Years : Networked Video == Parseable Video</title>
	<atom:link href="http://www.walking-productions.com/notslop/2007/02/26/networked-video-in-10-years-networked-video-parseable-video/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.walking-productions.com/notslop/2007/02/26/networked-video-in-10-years-networked-video-parseable-video/</link>
	<description>Not - Shawn's Little Outpost</description>
	<lastBuildDate>Sun, 10 Jan 2010 15:22:14 -0800</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: vanevery</title>
		<link>http://www.walking-productions.com/notslop/2007/02/26/networked-video-in-10-years-networked-video-parseable-video/comment-page-1/#comment-783</link>
		<dc:creator>vanevery</dc:creator>
		<pubDate>Tue, 27 Feb 2007 16:13:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.walking-productions.com/notslop/2007/02/26/networked-video-in-10-years-networked-video-parseable-video/#comment-783</guid>
		<description>Hi Dan,

I agree, there &lt;i&gt;could&lt;/i&gt; be software developed that implements search on video for things like faces and ducks and so on.  Unfortunately, every attempt at this that I have seen does a pretty poor job as compared with humans doing the identification.

My feelings are that the whole process of producing video needs to change.  Instead of creating moving images with a dumb camera that just captures light, moving images should be captured with cameras that know the time and location, that can be told the context and that allow video to be marked up and tagged on the spot.  Futhermore, the video shouldn&#039;t be one continous run, it should have scene detection, it should record the settings used in software like exposure, white balance, iris, zoom level and so on.

This would give us a running start and isn&#039;t nessecarily difficult.</description>
		<content:encoded><![CDATA[<p>Hi Dan,</p>
<p>I agree, there <i>could</i> be software developed that implements search on video for things like faces and ducks and so on.  Unfortunately, every attempt at this that I have seen does a pretty poor job as compared with humans doing the identification.</p>
<p>My feelings are that the whole process of producing video needs to change.  Instead of creating moving images with a dumb camera that just captures light, moving images should be captured with cameras that know the time and location, that can be told the context and that allow video to be marked up and tagged on the spot.  Futhermore, the video shouldn&#8217;t be one continous run, it should have scene detection, it should record the settings used in software like exposure, white balance, iris, zoom level and so on.</p>
<p>This would give us a running start and isn&#8217;t nessecarily difficult.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan M</title>
		<link>http://www.walking-productions.com/notslop/2007/02/26/networked-video-in-10-years-networked-video-parseable-video/comment-page-1/#comment-782</link>
		<dc:creator>Dan M</dc:creator>
		<pubDate>Tue, 27 Feb 2007 15:41:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.walking-productions.com/notslop/2007/02/26/networked-video-in-10-years-networked-video-parseable-video/#comment-782</guid>
		<description>Certainly searchability, which requires &#039;parseability&#039;, and without relying on related/linked  text is key to making a really useful video &#039;encyclopedia&#039; out of the great pile of digital images gathering online.  Would not something along the lines of &#039;facial recognition software&#039; be the direction any such tool would have to take? Obviously the problem is considerably more complex than Western text which can be broken down and completely represented in less than 128 pieces. However I suspect that a study of shapes and their relationships could come up with a reasonable number of checks that would lead to near perfect matches. In other words, if you see a duck as an &#039;oval with a triangle attached at one end and a curved tube at the other&#039; with a size relationship thrown in you would in fact find way more actual images of ducks than anything else. Allowance would have to be made for ranking &#039;nearness of match&#039; since while a &#039;t&#039; is a &#039;t&#039; and &#039;oval&#039; can be many things, but that should be easily doable.

A vocabulary of image templates could be built, for example; human faces have certain features within a very limited range of relationships, so do cats, dogs, sailboats, chairs houses, etc., etc., etc.

If this scanning and indexing of images was done on an ongoing basis as the search engines do with text, it would seem to me that a routine search for images of this or that could be quite rapid, and at least as accurate as text based searches on a particular subject, without any reliance on linked text clues at all.</description>
		<content:encoded><![CDATA[<p>Certainly searchability, which requires &#8216;parseability&#8217;, and without relying on related/linked  text is key to making a really useful video &#8216;encyclopedia&#8217; out of the great pile of digital images gathering online.  Would not something along the lines of &#8216;facial recognition software&#8217; be the direction any such tool would have to take? Obviously the problem is considerably more complex than Western text which can be broken down and completely represented in less than 128 pieces. However I suspect that a study of shapes and their relationships could come up with a reasonable number of checks that would lead to near perfect matches. In other words, if you see a duck as an &#8216;oval with a triangle attached at one end and a curved tube at the other&#8217; with a size relationship thrown in you would in fact find way more actual images of ducks than anything else. Allowance would have to be made for ranking &#8216;nearness of match&#8217; since while a &#8216;t&#8217; is a &#8216;t&#8217; and &#8216;oval&#8217; can be many things, but that should be easily doable.</p>
<p>A vocabulary of image templates could be built, for example; human faces have certain features within a very limited range of relationships, so do cats, dogs, sailboats, chairs houses, etc., etc., etc.</p>
<p>If this scanning and indexing of images was done on an ongoing basis as the search engines do with text, it would seem to me that a routine search for images of this or that could be quite rapid, and at least as accurate as text based searches on a particular subject, without any reliance on linked text clues at all.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
