<?xml version="1.0" encoding="utf-8"?>
<!-- generator="Joomla! - Open Source Content Management" -->
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title>escience.ime.usp.br/vision</title>
		<description><![CDATA[vision]]></description>
		<link>http://escience.ime.usp.br/vision/</link>
		<lastBuildDate>Wed, 21 Aug 2019 14:55:52 +0000</lastBuildDate>
		<generator>Joomla! - Open Source Content Management</generator>
		<atom:link rel="self" type="application/rss+xml" href="http://escience.ime.usp.br/vision/rss"/>
		<language>en-gb</language>
		<item>
			<title>Exploring Structured Data in Textual Content from the Web: Methods, Techniques and Applications (June 04, 2013)</title>
			<link>http://escience.ime.usp.br/vision/seminary-altigran</link>
			<guid isPermaLink="true">http://escience.ime.usp.br/vision/seminary-altigran</guid>
			<description><![CDATA[<div class="feed-description"><div style="width: 100%; padding-bottom: 10px; padding-right: 0px;">
<div style="float: left; width: 65%; box-sizing: border-box; -moz-box-sizing: border-box; padding-right: 25px;">
<h4>ABSTRACT</h4>
<p style="text-align: justify;"><span style="text-align: justify; font-family: Tahoma, Helvetica, Arial, sans-serif; line-height: 1.3em;">Although search engines are currently the most effective and popular tools for Information Retrieval on the Web, there is now a consensus that it is still possible to exploit more effectively the potential of these systems. This is particularly true in the current scenario of expansion of social networks, consolidation of the Web 2.0, and emergence of the so called Web of Data. This finding led to the emergence of multiple proposals to increase the expressive power of queries over Web content, both from the syntactical point of view, for example, by the adoption of the XML technology, and from the semantic point of view, for example, through the adoption of the resources collectively known as the Semantic Web.</span></p>
<p style="text-align: justify;"><span style="line-height: 1.3em;"><span>Although very promising, some of these proposals have run into a difficulty in the adoption of standards, which is an inherent characteristic of the nature of the Web. In this talk we focus on another possible perspective to address this issue: the development of methods and techniques for automatically gathering, extracting and exploiting (semi) structured data that are implicitly available in the vast unstructured textual content on the Web. Works that seek to effectively exploit these data have appeared in the literature for over a decade. However, a series of recent advances in Information Retrieval, Machine Learning and Data Mining, gave this issue a new impulse on the scientific community. This can be evidenced by the considerable space that venues of important areas such as Databases, Information Retrieval and Artificial Intelligence have devoted to research work related to it. Such an interest is justified not only by the challenging problems that arise, but mainly by the growing demand from industry to solve these problems. This makes the results of research on this subject not only immediately applicable, but also motivate a continuous feedback for the scientific investigation around it. The theme involves several classes of problems, and some of these classes of problems will be addressed here, namely: Data Extraction from Textual Sources, Focused Crawling of Web Pages, Integration of Data available in Textual Web Sources and Web Search Considering Structural Features.</span></span></p>
</div>
<div style="float: left; background-color: #f3f3f3; color: #353535; width: 35%; box-sizing: border-box; -moz-box-sizing: border-box; border-radius: 10px; -webkit-border-radius: 10px; padding: 10px; border-color: #ccc; border-style: solid; border-width: 1px;" dir="ltr">
<h4><img src="http://escience.ime.usp.br/vision/images/icons/more.png" border="0" width="24" height="24" style="margin-right: 15px; vertical-align: middle;" />EVENT DETAILS</h4>
<hr /><!-- type--> <!--<div style="width: 15%; float: left;"><img src="http://escience.ime.usp.br/vision/images/icons/diploma-icon.png" border="0" width="24" height="24" /></div>
<div style="width: 85%; float: left;"><strong>Category</strong><br />Seminars</div>
<div> </div>--> <!-- Keynote speaker -->
<div style="width: 15%; float: left;"><img src="http://escience.ime.usp.br/vision/images/icons/professor.png" border="0" width="24" height="24" /></div>
<div style="width: 85%; float: left;"><strong>Keynote Speaker</strong><br />Prof. Dr. Da Silva, Altigran Soares - Federal University of Amazonas (IComp / UFAM)</div>
<div> </div>
<!-- Slides -->
<div style="width: 15%; float: left;"><img src="http://escience.ime.usp.br/vision/images/icons/fdownloads-icon.png" border="0" width="24" height="24" /></div>
<div style="width: 85%; float: left;"><strong>Seminar files</strong><br /><a href="http://www.ime.usp.br/~jef/Palestra-Altigran-04062013.pdf" title="Palestra Altigran">Download Slides</a> </div>
<div> </div>
<!-- date -->
<div style="width: 15%; float: left;"><img src="http://escience.ime.usp.br/vision/images/icons/date.png" border="0" width="24" height="24" /></div>
<div style="width: 85%; float: left;"><strong>Date</strong><br />June 04, 2013</div>
<div> </div>
<!-- Hour -->
<div style="width: 15%; float: left;"><img src="http://escience.ime.usp.br/vision/images/icons/clock-r.png" border="0" width="24" height="24" /></div>
<div style="width: 85%; float: left;"><strong>Time</strong><br />13:00 to 14:00</div>
<div> </div>
<!-- place -->
<div style="width: 15%; float: left;"><img src="http://escience.ime.usp.br/vision/images/icons/place.png" border="0" width="24" height="24" /></div>
<div style="width: 85%; float: left;"><strong>Place</strong><br />_________ <br /> Cidade Universitária <br /> IME - USP <br /> São Paulo</div>
<div> </div>
</div>
</div>
<!-- fin --><hr /></div>]]></description>
			<author>maratausinchi@gmail.com (Mariela Atausinchi)</author>
			<category>Featured</category>
			<category>Seminars</category>
			<pubDate>Mon, 06 May 2013 16:44:06 +0000</pubDate>
		</item>
	</channel>
</rss>
