Type of Document Master's Thesis Author Sripadham, Shankar B Author's Email Address email@example.com URN etd-08042000-15330050 Title Semantic Decomposition By Covering Degree Master of Science Department Electrical and Computer Engineering Advisory Committee
Advisor Name Title Cyre, Walling R. Committee Chair Broadwater, Robert P. Committee Member VanLandingham, Hugh F. Committee Member Keywords
- branch and bound algorithm
- Linguistic interpretation
- Sentence interpretation
Date of Defense 2000-08-04 Availability unrestricted AbstractSemantic Decomposition by Covering
Shankar B. Sripadham
This thesis describes the implementation of a covering algorithm for semantic decomposition of sentences of technical patents. This research complements the ASPIN project that has a long term goal of providing an automated system for digital system synthesis from patents.
In order to develop a prototype of the system explained in a patent, a natural language processor (sentence-interpreter) is required. These systems typically attempt to interpret a sentence by syntactic analysis (parsing) followed by semantic analysis. Quite often, the technical narrative contains grammatical errors, incomplete sentences, anaphoric references and typological errors that can cause the grammatical parse to fail. In such situations, an alternate method that uses a repository of pre-compiled, simple sentences (called frames) to analyze the sentences of the patent can be a useful back up. By semantically decomposing the sentences of patents to a set of frames whose meanings are fully understood, the meaning of the patent sentences can be interpreted.
This thesis deals with the semantic decomposition of sentences using a branch and bound covering algorithm. The algorithm is implemented in C++. A number of experiments were conducted to evaluate the performance of this algorithm. The covering algorithm uses a standard branch and bound algorithm to semantically decompose sentences. The algorithm is fast, flexible and can provide good (100 % coverage for some sentences) coverage results. The system covered 67.68 % of the sentence tokens using 3459 frames in the repository. 54.25% of the frames identified by the system in covers for sentences, were found to be semantically correct. The experiments suggest that the performance of the system can be improved by increasing the number of frames in the repository.
Filename Size Approximate Download Time (Hours:Minutes:Seconds)
28.8 Modem 56K Modem ISDN (64 Kb) ISDN (128 Kb) Higher-speed Access finalthesis2a.pdf 425.90 Kb 00:01:58 00:01:00 00:00:53 00:00:26 00:00:02
If you have questions or technical problems, please Contact DLA.