วันจันทร์ที่ 17 ตุลาคม พ.ศ. 2554

Chapter 8 : How do search engines work?

 

  • Internet search engines are web search engines that search and retrieve information on the web. Most of them use crawler indexer architecture. They depend on their crawler modules. Crawlers also referred to as spiders are small programs that browse the web.
  • Crawlers are given an initial set of URLs whose pages they retrieve. They extract the URLs that appear on the crawled pages and give this information to the crawler control module. The crawler module decides which pages to visit next and gives their URLs back to the crawlers.
  • The topics covered by different search engines vary according to the algorithms they use. Some search engines are programmed to search sites on a particular topic while the crawlers in others may be visiting as many sites as possible.
  • The crawl control module may use the link graph of a previous crawl or may use usage patterns to help in its crawling strategy.
  • The indexer module extracts the words form each page it visits and records its URLs. It results into a large lookup table that gives a list of URLs pointing to pages where each word occurs. The table lists those pages, which were covered in the crawling process.
  • A collection analysis module is another important part of the search engine architecture. It creates a utility index. A utility index may provide access to pages of a given length or pages containing a certain number of pictures on them.
  • During the process of crawling and indexing, a search engine stores the pages it retrieves. They are temporarily stored in a page repository. Search engines maintain a cache of pages they visit so that retrieval of already visited pages expedites.
  • The query module of a search engine receives search requests form users in the form of keywords. The ranking module sorts the results.
  • The crawler indexer architecture has many variants. It is modified in the distributed architecture of a search engine. These search engine architectures consist of gatherers and brokers. Gatherers collect indexing information from web servers while the brokers give the indexing mechanism and the query interface. Brokers update indices on the basis of information received from gatherers and other brokers. They can filter information. Many search engines of today use this type of architecture.

    Example of 5 engines  on the Internet





    http://ask.com/
     


      วันอาทิตย์ที่ 16 ตุลาคม พ.ศ. 2554

      Ideas to help and support the flooding crisis in Thailand



      - Every community affected is already in desperate need of water, rice, basic supplies, even clothes.

      - After  flooding crisis. Project concept and idea to help flood victim after flood. looking for ideas to help flooding crisis  flood victim to earn income after flood eg. Otop clothes bag, handmade books or calendar, key ring.

      วันจันทร์ที่ 12 กันยายน พ.ศ. 2554

      Chapter 3 : Library


       Library


      What is Sripatum University website? Identify URL
       http://web.spu.ac.th/


      Why should students go to library
      Search the information by the internet, Reading the books / newspaper / magazine, Borrow the books and VCDs that you need.

      What is the Library of Congress classification? (L.C)
      is subject-based classifying and arranging items by the subjects matter involved. LC uses Alphabet to represent the subjects Alphabet is A - Z


      What is the Dewey Decimal Classification? (D.D.C.)
      Is to use the number 000 - 900 to represent the subjects.

      What is call number?
       A call number is like an address: it tells us where the book is located in the library.



      What is the Online Public Access Catalog (OPAC)
      Is an online database of materials held by a library or group of libraries. Users search alibrary catalog principally to locate books and other material physically located at a library.


      Credit : http://en.wikipedia.org/wiki/Library_of_Congress
                 http://www.usg.edu/galileo/skills/unit03/libraries03_04.phtml
                 http://www.cerritos.edu/library/guides/research/LC_System.html

      วันอาทิตย์ที่ 11 กันยายน พ.ศ. 2554

      Chapter 2 : Objective facts/Subjective opinion

      1. Objective Facts

      Ex. Japanese Spitz



      The Japanese Spitz (日本スピッツ Nihon Supittsu) is a small to medium breed of dog of the Spitz type. The Japanese Spitz is a companion dog and pet. There are varying standards around the world as to the ideal size of the breed, but they are always larger than their smaller cousins, the Pomeranian. They were developed in Japan in the 1920s and 30s by breeding a number of other Spitz type dog breeds together. They are recognized by the vast majority of the major kennel clubs, except the American Kennel Club due to it being similar appearance to the white Pomeranian dog, American Eskimo Dog and Samoyed Dog. While they are a relatively new breed, they are becoming widely popular due to their favorable temperament and other features.


      The major health concern is patellar luxation, and a minor recurring concern is that the breed can be prone to runny eyes. They can act as reliable watchdogs, but are a type of companion dog and prefer to be an active part of the family. Although they might appear fluffy, they are a low maintenance breed as the coat has a non stick texture often compared to teflon.

      2. Subjective Opinion

      Ex. Japanese Spitz


      The Japanese Spitz is a small dog, around 33 cm  at the withers, with a somewhat square body, deep chest, and a very thick, pure white double coat. The coat consists of an outer coat that stands off from the soft inner coat, with fur shorter on the muzzle and ears as well as the fronts of the forelegs and the hindlegs. A ruff of longer fur is around the dog's neck. It has a pointed muzzle and small, triangular shape prick ears (ears that stand up.) The tail is long, heavily covered with long fur, and is carried curled over and lying on the dog's back. The white coat contrasts with the black pads and nails of the feet, the black nose, and the dark eyes. The large oval (akin to a ginko seed) eyes are dark and slightly slanted with white eyelashes, and the nose and lips and eye rims are black. The face of the Japanese Spitz is wedge-shaped.

      Credit http://en.wikipedia.org/wiki/Japanese_Spitz

      Chapter 1 Data, Information, Knowledge, and Wisdom

      Data, Information, Knowledge, and Wisdom

      by Gene Bellinger, Durval Castro, Anthony Mills
      There is probably no segment of activity in the world attracting as much attention at present as that of knowledge management. Yet as I entered this arena of activity I quickly found there didn't seem to be a wealth of sources that seemed to make sense in terms of defining what knowledge actually was, and how was it differentiated from data, information, and wisdom. What follows is the current level of understanding I have been able to piece together regarding data, information, knowledge, and wisdom. I figured to understand one of them I had to understand all of them.
      According to Russell Ackoff, a systems theorist and professor of organizational change, the content of the human mind can be classified into five categories:
      1. Data: symbols

      2. Information: data that are processed to be useful; provides answers to "who", "what", "where", and "when" questions

      3. Knowledge: application of data and information; answers "how" questions

      4. Understanding: appreciation of "why"

      5. Wisdom: evaluated understanding.
      Ackoff indicates that the first four categories relate to the past; they deal with what has been or what is known. Only the fifth category, wisdom, deals with the future because it incorporates vision and design. With wisdom, people can create the future rather than just grasp the present and past. But achieving wisdom isn't easy; people must move successively through the other categories.
      A further elaboration of Ackoff's definitions follows:
      Data... data is raw. It simply exists and has no significance beyond its existence (in and of itself). It can exist in any form, usable or not. It does not have meaning of itself. In computer parlance, a spreadsheet generally starts out by holding data.
      Information... information is data that has been given meaning by way of relational connection. This "meaning" can be useful, but does not have to be. In computer parlance, a relational database makes information from the data stored within it.
      Knowledge... knowledge is the appropriate collection of information, such that it's intent is to be useful. Knowledge is a deterministic process. When someone "memorizes" information (as less-aspiring test-bound students often do), then they have amassed knowledge. This knowledge has useful meaning to them, but it does not provide for, in and of itself, an integration such as would infer further knowledge. For example, elementary school children memorize, or amass knowledge of, the "times table". They can tell you that "2 x 2 = 4" because they have amassed that knowledge (it being included in the times table). But when asked what is "1267 x 300", they can not respond correctly because that entry is not in their times table. To correctly answer such a question requires a true cognitive and analytical ability that is only encompassed in the next level... understanding. In computer parlance, most of the applications we use (modeling, simulation, etc.) exercise some type of stored knowledge.
      Understanding... understanding is an interpolative and probabilistic process. It is cognitive and analytical. It is the process by which I can take knowledge and synthesize new knowledge from the previously held knowledge. The difference between understanding and knowledge is the difference between "learning" and "memorizing". People who have understanding can undertake useful actions because they can synthesize new knowledge, or in some cases, at least new information, from what is previously known (and understood). That is, understanding can build upon currently held information, knowledge and understanding itself. In computer parlance, AI systems possess understanding in the sense that they are able to synthesize new knowledge from previously stored information and knowledge.
      Wisdom... wisdom is an extrapolative and non-deterministic, non-probabilistic process. It calls upon all the previous levels of consciousness, and specifically upon special types of human programming (moral, ethical codes, etc.). It beckons to give us understanding about which there has previously been no understanding, and in doing so, goes far beyond understanding itself. It is the essence of philosophical probing. Unlike the previous four levels, it asks questions to which there is no (easily-achievable) answer, and in some cases, to which there can be no humanly-known answer period. Wisdom is therefore, the process by which we also discern, or judge, between right and wrong, good and bad. I personally believe that computers do not have, and will never have the ability to posses wisdom. Wisdom is a uniquely human state, or as I see it, wisdom requires one to have a soul, for it resides as much in the heart as in the mind. And a soul is something machines will never possess (or perhaps I should reword that to say, a soul is something that, in general, will never possess a machine).
      Personally I contend that the sequence is a bit less involved than described by Ackoff. The following diagram represents the transitions from data, to information, to knowledge, and finally to wisdom, and it is understanding that support the transition from each stage to the next. Understanding is not a separate level of its own.
      Data represents a fact or statement of event without relation to other things.
      Ex: It is raining.
      Information embodies the understanding of a relationship of some sort, possibly cause and effect.
      Ex: The temperature dropped 15 degrees and then it started raining.
      Knowledge represents a pattern that connects and generally provides a high level of predictability as to what is described or what will happen next.
      Ex: If the humidity is very high and the temperature drops substantially the atmospheres is often unlikely to be able to hold the moisture so it rains.
      Wisdom embodies more of an understanding of fundamental principles embodied within the knowledge that are essentially the basis for the knowledge being what it is. Wisdom is essentially systemic.
      Ex: It rains because it rains. And this encompasses an understanding of all the interactions that happen between raining, evaporation, air currents, temperature gradients, changes, and raining.
      Yet, there is still a question regarding when is a pattern knowledge and when is it noise. Consider the following:
      • Abugt dbesbt regtc uatn s uitrzt.
      • ubtxte pstye ysote anet sser extess
      • ibxtedstes bet3 ibtes otesb tapbesct ehracts
      It is quite likely this sequence represents 100% novelty, which means it's equivalent to noise. There is no foundation for you to connect with the pattern, yet to me the statements are quite meaningful as I understand the translation with reveals they are in fact Newton's 3 laws of motion. Is something knowledge if you can't understand it?
      Now consider the following:
      • I have a box.
      • The box is 3' wide, 3' deep, and 6' high.
      • The box is very heavy.
      • The box has a door on the front of it.
      • When I open the box it has food in it.
      • It is colder inside the box than it is outside.
      • You usually find the box in the kitchen.
      • There is a smaller compartment inside the box with ice in it.
      • When you open the door the light comes on.
      • When you move this box you usually find lots of dirt underneath it.
      • Junk has a real habit of collecting on top of this box.
      What is it?
      A refrigerator. You knew that, right? At some point in the sequence you connected with the pattern and understood it was a description of a refrigerator. From that point on each statement only added confirmation to your understanding.
      If you lived in a society that had never seen a refrigerator you might still be scratching your head as to what the sequence of statements referred to.
      Also, realize that I could have provided you with the above statements in any order and still at some point the pattern would have connected. When the pattern connected the sequence of statements represented knowledge to you. To me all the statements convey nothing as they are simply 100% confirmation of what I already knew as I knew what I was describing even before I started.

      References:

      Credit http://www.systems-thinking.org/dikw/dikw.htm