Abstract: Improved information processing techniques for measuring similarity between instances in an ontology are disclosed. For example, a method of measuring similarity between instances in an ontology for use in an information retrieval system includes the following steps. A set of instances from the ontology is obtained. At least one of the following similarity metrics for the set of instances is computed: (i) a first metric that measures similarity between instances in the set of instances with respect to ontology concepts to which the instances belong; (ii) a second metric which measures similarity between instances in the set of instances where the instances are subjects in statements involving a given ontology property; and (iii) a third metric which measures similarity between instances in the set of instances where the instances are objects in statements involving a given ontology property.
Abstract: Embodiments of systems and methods for comparing attributes of a data record are presented herein. In some embodiments, a weight is based on a comparison of the name (or other) attributes of data records. In some embodiments, an information score may be calculated for each of two name attributes to be compared to get an average information score for the two name attributes. The two name attributes may then be compared against one another to generate a weight between the two attributes. This weight can then be normalized to generate a final weight between the two business name attributes. Comparing attributes according to embodiments disclosed herein can facilitate linking data records even if they comprise attributes in languages which do not use the Latin alphabet.
Abstract: An exemplary embodiment of the invention relates to a method and storage medium for providing web-based electronic research and presentation. The method includes scanning an active document on a computer to identify relevant keywords, searching a database for reference materials relating to the relevant keywords, and displaying relevant reference materials on the computer. The method also includes integrating process software for providing the web-based electronic research and presentation functions via a document creation application. The integration includes determining if the process software will execute on a server, identifying an address of the server, checking the server for operating systems, applications, and version numbers for validation with the process software, and identifying any missing applications required for integration.
Type:
Application
Filed:
April 18, 2008
Publication date:
August 14, 2008
Applicant:
INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventors:
Edward E. Kelley, Tijs Y. Wilbrink, Ellis Zijlstra
Abstract: Embodiments for hyperlink content abstraction are disclosed. In one embodiment a method of hyperlink content abstraction includes selecting a hyperlink with a user directed pointing device, implementing computing device executable instructions to access an electronic document linked to the hyperlink, analyzing a number of content items within the electronic document to determine the content of the document, and compiling a summary of the content of the electronic document based upon the analysis.
Abstract: In a search system for searching a database based on a keyword input from a portable terminal while outputting a result of the search to the portable terminal to display, there are provided a search server that searches the database for information including the input keyword, a degree-of-popularity calculating apparatus which acquires number-of-member information from an i-ModeĀ® server having the number-of-member information on the number of members for each menu in which various webpages are sorted for each category thereof, and calculates the degree of popularity of the menu based on the acquired number-of-member information, and information ranking means for ranking a plurality of pieces of information searched by the search server based on the degree of popularity of the menu to which the information belongs generated by the degree-of-popularity calculating apparatus.
Abstract: A system and methods for comparing differences and similarities of at least two models including generating corresponding metamodel maps, visual representation of the models, and conducting a series of phases of comparison of the models using a mapping index, wherein the mapping index includes the metamodel maps and the visual representation of the models to produce a comparison output.
Abstract: Disclosed are apparatus and methods for quantifying the value of purchasing a particular search keyword, so that a particular search result is presented in a sponsored search results list for that particular search term, as compared to not purchasing the particular search keyword. In example embodiments, the quantified value of the particular search term indicates the particular search term's incremental value when the particular search result is presented in the sponsored search result list, as compared to when the same particular search result is not presented in such sponsored search result list. The particular search term's incremental value is based on a difference between the sum of the number of searchers who select the particular search result from the sponsored search list and the algorithmic search list, if any, versus the number of searchers who select the particular search result when it is not presented in the sponsored search list and may be only presented in the algorithmic search results.
Abstract: Methods and apparatus, including computer systems and program products, for executing a query on a subset of data, for example, to facilitate a fast search with a very large result set. In one general aspect, a method of executing a query includes receiving a query for execution on data in the data repository; generating an estimate of a number of results of the query; defining a subset of data in the data repository; determining whether to execute the query on the subset of the data; executing the query on the subset of the data to generate a partial set of results if the query is to be executed on the subset of the data, otherwise executing the query on the data repository to generate a complete set of results; and providing query results.
Abstract: A method and system for searching a broad set of electronically based unrelated documents in a manner that identifies the interlinking characteristics between the documents returned via several iterative levels of search results is provided. The interlinking characteristics are then analyzed using a betweenness centrality algorithm to calculate the relative strength of the interlinking relationships in order to identify and create the shortest search paths that lead a user to results having the highest betweeness centrality or having the highest relevance to the stated query.
Abstract: Techniques are described for ranking the relevance of electronic documents, such as web pages. An algorithm extracts keywords and recurring phrases from the anchor tag data in electronic documents to define a set of concepts. The algorithm then uses link, concept pairs to create nodes in a graph. In this graph, edges can represent both explicit and implicit conceptual links between nodes. By including conceptual data, the algorithm may model and utilize inter-concept relationships when using graph ranking algorithms. This may improve result accuracy by not only retrieving links which are more authoritative given a users' context, but also by utilizing a larger pool of web pages that are limited by concept-space, rather than keyword-space.
Type:
Application
Filed:
June 27, 2007
Publication date:
February 7, 2008
Applicant:
Regents of the University of Minnesota
Inventors:
Colin DeLong, Sandeep Mane, Jaideep Srivastava