Web Based Visual Exploration of Patent Information: Difference between revisions
No edit summary |
No edit summary |
||
Line 47: | Line 47: | ||
===Visualization of Classificatory Information=== | ===Visualization of Classificatory Information=== | ||
[[{{ns:6}}:Class.jpg |thumb|right|300px|'''Figure 3''': Treemap of Classificatory Data of Patent Information]] | |||
With a treemap it is possible to show the hierarchical structure of the [http://en.wikipedia.org/wiki/International_Patent_Classification International Patent Classification] IPC. The IPC categories are rendered as nested rectangular areas, which help to keep a clear view of the used categories in large patent sets. To get more information and further details it is possible to zoom into specific parts for an enlargment of the specific labels. An alleviation in these technique is to search for keywords in the titles of IPC categories. The search results are shown as a list and in the treemap as rectangles with red and green filling. The red items are not part of the patent set, the green items are part of the patent set. | With a treemap it is possible to show the hierarchical structure of the [http://en.wikipedia.org/wiki/International_Patent_Classification International Patent Classification] IPC. The IPC categories are rendered as nested rectangular areas, which help to keep a clear view of the used categories in large patent sets. To get more information and further details it is possible to zoom into specific parts for an enlargment of the specific labels. An alleviation in these technique is to search for keywords in the titles of IPC categories. The search results are shown as a list and in the treemap as rectangles with red and green filling. The red items are not part of the patent set, the green items are part of the patent set. | ||
Line 52: | Line 54: | ||
====Relatet Topics==== | ====Relatet Topics==== | ||
[http://www.infovis-wiki.net/index.php?title=Treemap More Info on Treemaps] | * [http://www.infovis-wiki.net/index.php?title=Treemap More Info on Treemaps] | ||
[http://www.infovis-wiki.net/index.php?title=Extreme_visualization:_squeezing_a_billion_records_into_a_million_pixels Squeezing a Billion Records Into a Million Pixels] | * [http://www.infovis-wiki.net/index.php?title=Extreme_visualization:_squeezing_a_billion_records_into_a_million_pixels Squeezing a Billion Records Into a Million Pixels] | ||
[[Category:Techniques]] | [[Category:Techniques]] |
Revision as of 16:39, 27 May 2009
Motivation
On the one hand, patents are a well known instrument to protect intellectual properties. On the other hand, there are several aspects which are often neglected by the scientific community: analyzing patent information can lead to e.g. identification of the direction of technical change, forecasts of market values and getting insights into your competitors work and inventions. At present there are about 60 million patent documents existing worldwide mostly comprising natural language supplemented by figures, diagrams, formulas, etc. As a consequence of this, it is needless to allude that computer support is heavily required. Besides this, patent evaluation still takes a lot of human effort. Therefore, it is very important to explore new approaches in software supported retrieval and visualization of patent information. The main goal in this connection is to improve the users cognition processes; e.g. to faster identify potential search results/constraints etc.
Characteristics of Patent Information
Because of national patent laws, patent documents are following a well-defined structure arranged on a title page:
- bibliographic data and an abstract
- classificatory data
- description of the state of the art
- claims section
- images, diagrammes, formulae, etc. (optional)
As seen above, patent documents comprise some metadata such as bibliographic data and classificatory data. Nevertheless, for evaluation there often is some further information needed: The legal status and patent family information, which both have to be fetched from external data sources like the Europaean Patent Office
Legal Status
The legal status specifies significant events in the lifetime of a patent like changes of the owner etc.
Patent Family
Patents which describe the same invention belong to the same patent family. A common definition says that a patent document belongs to a respective patent family if the claims section and priorities of this document are matching the other patents claims and priorities of this family (e.g. protecting an invention in different countries).
Classificatory Data
The International Patent Classification (IPC) is the number one classification scheme of patent documents for ordering them in terms of technical fields. Therefore the IPC comprises 68.000 (advanced) or 17.000 (core) categories and is hierarchically organized in 13 levels.
Visualization Techniques
Mass-Spring Graphs
During search, it is often a matter of concern to identify clusters or groups e.g. patents that are related to each other by different criteria (same category, same inventor, etc.) and visualize them in an appropriate manner. A common way to do this are (interactive) mass-spring graphs. They are an adequate and intuitive form of visualizing clusters and interrelations between paticular nodes. To simulate the physics of this kind of force-directed graphs, you can use the Barnes-Hut Algorithm that allows efficient n-body force calculations in O(n log n) time.
Figure 1 shows the relationship between patents (yellow) and their corresponding priority document (brown). In the special case of patent documents it is thus possible to i.e. easily identify the priority applications that have been the groundwork for other patents. In gereral, mass-spring graphs are suitable for search through all kinds of hierarchically data to show their interrelationships.
Demonstrations of Related Work and Mass-Spring Graphs
Patent Family Layout
Next to the mass-spring graph, the patent family layout is also part of visually clustering patent sets. It is based on filling date and country in an interactive 2D matrix that are represented as red and blue nodes. The filing date of the patents and priorities are shown on the horizontal position (x-axis). On vertical position, every line describes a country or organization where the patents or priorities have been filed. Arcs will point from a patent to it's priority document and the priority relations are visualized if a node is selected. To get a quick overview of often referenced documents, the node size displays the number of ingoing arcs of the node.
Visualization of Classificatory Information
With a treemap it is possible to show the hierarchical structure of the International Patent Classification IPC. The IPC categories are rendered as nested rectangular areas, which help to keep a clear view of the used categories in large patent sets. To get more information and further details it is possible to zoom into specific parts for an enlargment of the specific labels. An alleviation in these technique is to search for keywords in the titles of IPC categories. The search results are shown as a list and in the treemap as rectangles with red and green filling. The red items are not part of the patent set, the green items are part of the patent set.