Search as Conversation:

farmpaintlickInternet και Εφαρμογές Web

21 Οκτ 2013 (πριν από 3 χρόνια και 9 μήνες)

91 εμφανίσεις

Search as Conversation:

Search and the ‘Net @ 2011

Michael Hunter

Reference Librarian

Hobart and William Smith Colleges


For Rochester Regional Library Council

Member Libraries’ Staff

Sponsored by the

Rochester Regional Library Council

Supported by Regional Bibliographic Databases and Resources Sharing (RBDB) funds
granted by the

New York State Library 2010



For Today…


The Searchscape @ 2011


Major trends and related services


Other new services


The year @ Google


The year @ other established services


Future directions



Linklist:

http://people.hws.edu/hunter/

searchnet11.htm


The Searchscape @ 2011

Center for the Digital Future


2010 Study

USC Annenberg School for Communication
http://www.digitalcenter.org
Issues of Trust


61%
-

Half or less of online information
is reliable


14%
-

Small portion or none of online
information is reliable (new high)


36%
-

About half of information
from
search engines

is reliable


12%
-

Small portion of none of the
information
from search engines

is
reliable


Center for the Digital Future


2010 Study

2000
-
2009 US Comparisons

Internet Use in the
United States

Hours per week
online

Center for the Digital Future


2010 Study

2000
-
2009 US Comparisons

Broadband access
at home

Most of all online
information is
reliable

Center for the Digital Future


2010 Study

2000
-
2009 US Comparisons

Internet users who
buy online

Communications
technology makes
the world a better
place

What Americans Do Online
nielsenwire Aug. 2, 2010

What Americans Do Online
nielsenwire Aug. 2, 2010

Social networking sites used by students

Students and Information Technology, 2010

ECAR Research Study 6, 2010

How social networking sites are used

Students and Information Technology, 2010

ECAR Research Study 6, 2010

Web Search @ 2011

Who’s crawling the Web?


Google


Yahoo (?)


Bing


Gigablast


Blekko


Exalead


Yandex


Major Trends and Services


Web Curation

Real
-
Time Search

Mobile Search

Other New Services


Web 3.0


Coming of age?


Web 3.0
-

Web
-
based applications or
services that combine metadata, human
interaction and artificial intelligence to
provide a more productive and intuitive
user experience


First used by John Markoff of the New
York Times in 2006


Key Components of Web 3.0

Expanded from Wikipedia “Web 3.0” (Nova Spivack)
http://en.wikipedia.org/wiki/Web_3.0


Semantic Web


Embedded Metadata


Intelligent Applications


Human Judgment and Analysis


Expanded Network Computing


Open Technologies


Open Source


Distributed Databases

Web Curation:

Why we need it and how it can help…


Current problems with whole web
search


Junk


Spam


Term ambiguity (“jaguar”, “saturn”)


User resistance to personalization


Risk of skewing results to favor certain
sites/viewpoints

“Content farms”


Companies that employ freelance writers to
create content specifically designed to rank high
in SE results


Ads are embedded or linked to this content


“Deliver as much as possible as cheaply as
possible”


Demand Media

alone publishes over 1 m articles
a month


Can subvert a SE’s “organic” search algorithms


Blekko’s slashtags help avoid span and content
farm output



Curating the web with Blekko

http://blekko.com


Human/crawler service


Machine “tagging”
-

filetype, date,
language, domain type, etc.


Blekko (human) editors create “topic”
and “built
-
in” tags used to label content
in the Blekko crawler database.


Registered users can create their own
tags for any site in Blekko database


Blekko can be used to create a personal,
searchable web


Blekko: Under the hood


Web search results with relevant
slashtags on the left


Adding a slashtag limits the search to
those sites so tagged


Note: adding multiple Blekko “topic”
slashtags limits the search to sites
which have ALL the tags

Blekko: Under the hood


Slashtag types


Topic


Built
-
in


Content boxes


Third party API’s


User
-
supplied


Results enhancements


Tag the site
(registered users)


SEO


data on inbound
links, crawl stats, etc.


Cache


Links


Info


Tag as spam


More…

Blekko’s weaknesses


Small index


No advanced search features


False hits for certain slashtags


/review

/green

/funny


Possible relevance problems


BUT,
it’s still in BETA

Real
-
time Search


Searching and accessing material as
soon as it is published


News


Blogs


Microblogs


Twitter, Facebook


Social Media sites


Delicious, Digg

Real
-
time meta search

Collecta


http://collecta.com


A real
-
time “newspaper”


Searches blog posts and comments
from Twitter, Jaiku and Identica


Photos from Flickr, TwitPic and yFrog


Videos from YouTube and Ustream


Allows sorting by updates, images,
stories, etc.

Real
-
time meta search

Scoopler

http://scoppler.com


Searches Twitter, Flickr, Digg,
Delicious


Search by subject category


Sort by Web links, images, videos


Real
-
time updating of Tweets
matching your query

Real
-
time at Google


Blended with standard search results


“Latest results for ….”


Available for topics with “buzz”


Shown if real
-
time information is
available from “high quality” sites

Real
-
time at Bing

http://www.bing.com/social


Separate service


Content mainly from Twitter and
Facebook


Two categories


Shared Links


Public Updates (Tweets and posts)


Current updates scroll through when
received

Mobile Search



What people want in mobile
search today…



Voice
-
to
-
text search


Search by images


High personalization in results


Location
-
aware results


Major players


Google


clearly in the lead


Yahoo


strong 2
nd

place


Bing


just entering the race


Device OS often have a default search
service embedded



How students use mobile devices

Students and Information Technology, 2010

ECAR Research Study 6, 2010



Internet use
(excluding text messaging)


Check for information such as


News

Weather

Sports Specific facts


Send and receive e
-
mail


Check social networking sites


Find maps and directions


IM, shopping, download/stream music


Hyperlocal

Search

Outsidein

http://outside.in



“What’s happening right here!”


Searches local news feeds, TV stations
and blogs


Search by city, ZIP, address


Search by keyword
within

any of these
geographical options


RSS feed available for any search

Other New Services

Swingly
: Just the facts

http://swingly.com



Just leveraging social networks can
yield
opinions

not facts


Purely automated service


Utilizes semantic text extraction and
other nlp processing to identify
“facts”


As of 10/15/10


Swingly


100 b Q&A pairs


Ask.com


500 m Q&A pairs

Swingly
: Just the facts

http://swingly.com



Strengths are Who, What, When


Moderately successful with Where


Don’t ask for Why


Additional related information offered


User feedback invited on each search
result


approve, disapprove, comment


Currently in private beta


Login with OpenIDURL:


http://mhunter1.myopenid.com


PW= rrlc123

Surfcanyon

http://surfcanyon.com



Features real
-
time relevance feedback


Resorts Google results based on sites
you visit and how long you are there


Titles and snippets used to predict
relevance in re
-
sorting results


Available as an extension on Firefox,
IE and Chrome for search Google,
Bing, Yahoo and Craigslist

Find the best


Web 3.0 Decision Engine

http://Findthebest.com



Over 277 m Google queries contain
“the best…”


Findthebest answers those types of
queries only


Human
-
created “comparison apps”
compare 2 or more choices side by
side


Content selected and updates by
human editors

Update on the
metas
:

Zuula

http://zuula.com


Searches Google, Bing, Exalead,
Gigablast and Mojeek


“Dynamic pruning” eliminates
duplicates as you scroll through results
from different engines


Search categories include Web, Blogs,
News, Jobs and Tagging Sites
(StumbleUpon, Delicious

Update on the
metas
:

Startpage

http://startpage.com


Formerly Ixquick


Searches Ask, Bing, Yahoo, Blekko (!),
and 7 others


Offers “search by proxy” to avoid
recognition of your IP or cookies


Offers more advanced search features
than most metas

The Year @ Google


Caffeine


Instant


Filter by Location


Public Data
Explorer





Google Editions


Book Agreement


Privacy Issues


Caffeine


New search infrastructure


Changes are “primarily in how we
index”


Enables changes in search algorithms
to be added more easily


Minimal effects on relevance and
ranking


Changes chiefly in ranking of image,
video, news and blog posts.


Instant


Dynamic revision of search
results as you type


Public Data Explorer


http://www.google.com/publicdata/home


World Development Indicators


OECD Factbook


US Census Data


Eurostat Tables


Search by date range


Output options include line, bar or bubble
charts and maps

Google Ebooks store

http://google.com/ebooks


Launched December 6


Ca. ½ m. e
-
books initially


Trade books, professional and
technical titles and (free) out
-
of
-
copyright works


Agreements made with over 35,000
publishers



Google Ebooks store


Readable on smartphones, tablets,
netbooks, desktops and all digital
reading devices except Kindle.


63% of revenue to publishers and
authors


37% of revenue to Google


Google Books


the collection


G. has scanned over 12 million titles (U.S.
libraries estimated to hold 42 million
titles)


2 million public domain titles (pre
-
1923)


2 million scanned with publisher’s permission


8 million remain


“Orphans” (O.P, in copyright)


Copyrights not renewed (pub. 1923
-
1963)


Ca. 50% of 12 million are in foreign languages
(May be excluded from collection)


Google Books


the access


Public domain works


Free to read, download, print


In
-
print, out
-
of
-
copyright (orphans)


To read, download and/or print


$$$ institutional license (colleges,
universities, research centers)


$$$ consumer license (individuals)


Free


public access license through
ONE

terminal in public libraries


read only


($$$ to print or download)


In
-
print copyrighted


depends on
agreement with publisher


The Year at Other
Established Services

Bing

http://www.bing.com


Quick Tabs (results screen)


clustering by subject and context


Real
-
time updates of publications
that appear in the results set, eg.
NYT


Mapping


Geotagged panoramas


Astronomical images (Worldwide
Telescope)


Birds
-
eye image stitching

Yahoo/Bing Alliance
-
Highlights for searchers




Databases separate (for now)


Bing search technology used by both


Exalead

http://www.exalead.com/search


Many advanced search features; some
unique


Video search


Clustering by source and subject
categories (best available at any general
search engine)


Sort by relevance, recency, rating,
length, view count



Yahoo


“Trending Now”


Links to
searches on topics with “buzz”


Ask.com


No longer a crawler
-
based search
engine


Return to original focus
-

Q&A
(the butler returns)


Teoma not updated for now

YouTube topic search

http://youtube.com/topics


Requires user to “opt in”


Following an initial search, suggested
topics appear at top


Run new searches by clicking on the
topics or add them to your current
search (+).


Future Directions


Real
-
time continues to ramp up


Users interact more and more with
search results and processing


Google Instant


updating as users type


Surf Canyon
-

real
-
time updates based on
users link choice and length of visit


Up in the Cloud, more and more….


MS Office “Web Apps”


Google’s Chrome OS





“The future of
search is
verbs”

(Bill Gates, 2008)


Increased natural language
capabilities


Web search is ultimately about actions
-

buying, selling, going, doing


Successful services will increasingly
select information for users based on its
probable
use
; going beyond just a set of
automated search results


Devices will increasingly converge


Why carry 2 devices when you only need
one?


“Five years out, a majority of all
searches will be done on mobile devices,
and a majority of those will be done
through voice”





Barney Pell





Search Strategist, Microsoft


Erosion of Privacy


Sharing personal information via social
networks


Push for “Personalization” in search
results


Increase in hand
-
held and other location
-
based GPS devices


Inclusion of video cameras in more
devices



Thank You and

Happy

Holidays!


Michael Hunter

Reference Librarian

Hobart and William Smith Colleges

Geneva, NY 1445


(315) 781
-
3014


hunter@hws.edu