Thursday, November 20, 2014

Muddiest Point: Week 11

What is the practical application of XML over HTML coding? I feel as though everyone I've talked to has said that HTML would be preferred for coding their own sites.

Wednesday, November 12, 2014

Week 11: Digital Library & Web Search

(1) As technology continues to advance, it is becoming more and more prevalent that libraries, educational systems, and scholarly communities move beyond the hardbound books and journals into a more digital front. (2) Institutional repositories would allow researchers to manage and distribute their digital materials created within a specific institution or by its community members. (3) These repositories would help to ensure access to these materials, as well as preservation and organization.  (4) A repository should not be limited to only researching faculty members, but should also include students and the community.  (5) With the masses of information being poured on to the internet through various sources, such as these institutional repositories, the question of how these are indexed comes to the forefront.  (6) Indexing algorithms are used by search engines, such as Google, Yahoo, and Microsoft (GYM), to rapidly identify indexing terms in the documents that may contain a particular word or phrase that a searcher is looking for.  (7) I won't even attempt to understand how these algorithms work because I am extremely ignorant when it comes to these things.  But I know that search engines operate on query algorithms which searches online resources for the query key terms. (8) Many search engines have various ways of speeding these query searches along. Some of these include skipping, early termination, document numbers, and caching.  (9) The quality of these results may vary from user to user.  For example, one user may search for Pot Roast looking for the history of pot roasts, but get top hits of recipe and find that to be irrelevant when another user would be looking exactly for pot roast recipes.  (10) It is important to understand that no search query is perfect.  The more that users post on the internet and deposit into online repositories, the more information that will be available for search queries to scour and, hopefully, produce better results.

Muddiest Point: Week 10

Are there any elements of the XML lab that we missed from this week that would have been useful for Assignment 5?

Wednesday, November 5, 2014

Week 10: XML

(1) Extensible Markup Language, or XML, is allows a set of rules for encoding documents to be read by both humans and machines.  (2) XML has a series of entities that contain one or more logical elements, which are what mark the start and end of each of the parts of an interchanged document. (3) Document Type Definitions, or DTDs, allow users of XML to check that the document occurs in a valid place within the interchanged data stream, but are not required for XML. (4) XML was not designed to be a standardized coding system, such as HTML is.  (5) XML is a formal language that can be used to pass information about the component parts of a document onto another computer system.  (6) XML is different from other markup languages in the fact that it clearly identifies the boundaries of every part of a document. (7) An XML file typically consists of three different types of markup; an XML processing instruction, a document type declaration, and a fully-tagged document instance.  (8) The processing instruction identifies the version of XML being used, how it is coded, and whether there are external files.  (9) The document type declarations can either contain a markup declaration in an internal subset [between square brackets] or an external subset [referencing a file containing markup declarations].  (10) Finally, a document instance consists of a root element that matches the assigned document type in the document type declaration.  (11) If all three components are present, and the document instance conforms to the rules defined in the document type definition, the document is said to be valid.

Muddiest Point: Week 9

For A5 submission, will the URL we enter onto the Word document be the "http://www.pitt.edu/~PittAccount/"?