QUESTION ANSWERING SYSTEM USING SIMILARITY AND CLASSIFICATION TECHNIQUES

Size: px
Start display at page:

Download "QUESTION ANSWERING SYSTEM USING SIMILARITY AND CLASSIFICATION TECHNIQUES"

Transcription

1 International Journal of Computer Systems (ISSN: ), Volume 03 Issue 07, July, 06 Available at QUESTION ANSWERING SYSTEM USING SIMILARITY AND CLASSIFICATION TECHNIQUES Nabeel Neamah Ȧ, Saidah Saad Ḃ Ȧ Faculty of Information Sciences and Technology, UKM, Bangi, Malaysia Ḃ School of IT, Faculty of Information Sciences and Technology, UKM, Bangi, Malaysia Abstract The main aim of question answering system is provide correct answers based on users queries. Question answering system developed to provide answers for various domains or restricted domain. There are main challenges face the question answering systems such as extract answers based on weak concepts of users queries and difficulty to retrieve accurate answers from large corpus of documents. These challenges increase the difficulty of questions analyzing and retrieve relevant and correct answers based on users queries. This research applies several NLP methods such as tokenization, stemming, and N-gram in order to analyze the users query effectively. Additionally, SVM method is deployed to classify the answers documents based on questions types in order to reduce the searching scope of proposed answers. The findings revealed that the average answers accuracy using CS technique is 67%, the average answers accuracy using LCS technique is 66%, the average answers accuracy using combination of CS and LCS techniques is 70%, and the average answers accuracy using CS, LCS, and SVM is 80%. Results accuracy involving SVM method is more accurate than other methods like CS and LCS. SVM enhance the system accuracy up to % more than using other methods without classification processes Keywords: Question Answering System, NLP, SVM, Hadiths, Classification, Similarity I. INTRODUCTION Nowadays, there are large increasing of information sources such as online sources; these sources contain huge volume of information related to variant fields of topics i.e. economic, health, industry, and educational information [, ]. Traditional retrieving systems like google search engine retrieve information based on searching keywords of the users' queries rather than retrieve exact answers based on searching queries [3, 4]. For example, traditional search engine will retrieve documents that contain words similar to "Iraq" and "capital" based on the query "what is the capital of Iraq" rather than retrieve the accurate answer which is "Baghdad". Therefore, the users could expense efforts and time to find exact answers from large sources. There are two important processes to ensure the accuracy of QAS; () analyze the users' Query needs using various methods such as Natural Processing Language (NLP), and () classify and manage the documents that contain the candidates' answers accurately based on many methods such as machine learning. Therefore, the accurate matching between users' questions and the proposed answers will be founded effectively [5, 6]. The main aim of methods such as NLP is to update the concepts of users' queries based on formal representation of documents concepts which maximize the opportunities of found the similarities between users' quires and documents contents. On the other hand, the questions and documents classifications will support the matching between the users' types of questions and the proposed answers based on questions types [6, 7]. The user s questions classify as many types such as "What" to inquire about facts and explanations, "Where" to ask about places, and "Who" to ask about persons; the documents classified based on the purpose of information depend on questions types i.e. places information to match "Where" questions. The main problem of this research is the difficulty of retrieve accurate answers based on Hadiths documents due to two main reasons which are as the following: i. Difficulty of provide formal concepts of Hadiths query: the Hadiths documents written based on Arabic languages using classical concepts. Currently, Arabic people are used the modern Arabic concepts. This would increase the difficulty of provide query concepts according to formal concepts of Hadiths. Also, non Arabic people face difficulty of provide the right concepts based on English language due to their weakness of Arabic language skills and knowledge weakness of Hadiths formal concepts using English. ii. Large document of Hadiths that provided by various resources: Hadiths are spoken by Mohammad (Islam messenger) and after many decades these Hadiths were written as texts. Currently, there are large numbers of Hadiths published through various sources such as internet and books. This could increase the difficulty of extract right Hadiths that match with users needs. The main objective of this research is to develop questionanswering systems using NLP and machine learning methods in order to retrieve accurate answers of Hadiths based on users' questions. The following section presents related works to this research. Section 3 explains the 54

2 research methodology. Section 4 presents the experimental data of the proposed QAS. Section 5 discusses the experimental results of this research. Lastly, section 6 presents the conclusion and future works. II. RELATED WORKS This presents literature of queries analysis, sentence similarity matching, WordNet Ontology, and documents classifications using machine learning. A. Questions Analysis Using NLP The queries analyzed and evaluated based on many factors such as question type and keywords. Reference [8] mentioned that, there are two parts for question analysis which are; () concepts analysis by extract the main or important concepts of users queries, and () concepts processing through update the analyzed concepts to be compatible with formal representations of QA domains concepts. According to [9], the main NLP methods for this stage are as the following:. Normalization: Text normalization is the process of transforming text into a single canonical form that it might not have before. Normalizing text before storing or processing it allows for separation of concerns, since input is guaranteed to be consistent before operations are performed on it. Text normalization requires being aware of what type of text is to be normalized and how it is to be processed afterwards; there is no all-purpose normalization procedure. Text normalization is frequently used when converting text to speech. Numbers, dates, acronyms, and abbreviations are non-standard "words" that need to be pronounced differently depending on context. For examples, "$00" would be pronounced as "two hundred dollars" in English, but as "lua selau tālā" in Samoan and "vi" could be pronounced as "vie," "vee," or "the sixth" depending on the surrounding words.. Remove stop words to remove the un-important keywords such as 'and', 'the', and 'has'. Stop words are words that are filtered out before or after processing of natural language data (text) []. There is no single universal list of stop words used by all processing of natural language tools, and indeed not all tools even use such a list. Some tools specifically avoid removing these stop words to support phrase search. Any group of words can be chosen as the stop words for a given purpose. For some search engines, these are some of the most common, short function words, such as the, is, at, which, and on. Other search engines remove some of the most common words including lexical words, such as "want" from a query in order to improve performance. 3. Tokenization which divides the text sequence into sentences and then the sentences into tokens. So, In the English language, words are bounded by whitespace and optionally preceded and followed by parentheses, quotes, or punctuation marks. Therefore, the tokenization divides the character sequence based on the whitespace positions or other punctuation marks between words in the sentence. In addition, it cuts off the parentheses and punctuation marks to obtain the sequence of tokens. 4. N-gram which works on divides the sentences into words in query, where the N-gram algorithm focuses on calculating word by word, two words by two words and so on. Word gram is a contiguous sequence of -n items from a given sequence of text or speech. The items can be phonemes, syllables, letters, words or base pairs according to the application. The n-grams typically are collected from a text or speech corpus. An n-gram of size is referred to as a "unigram"; size is a "bigram" (or, less commonly, a "diagram"); size 3 is a "trigram". Larger sizes are sometimes referred to by the value of n, e.g., "four-gram", "five-gram", and so on [, ]. B. Similarity Measure This section discusses the most common techniques of sentence similarity measuring which are; Cosine similarity (CS) and Longest Common Subsequences (LCS). In QAS, CS and LCS techniques are used widely to measure the similarity between users queries and system documents []. Based on CS and LCS techniques increase the opportunity of extract accurate answers depend on users questions. i. Cosine Similarity (CS) Cosine similarity (CS) is a well-known vector based similarity measure in the fields of text mining and information retrieval. Basically, this measure is extensively employed to estimate the relationship between words, the strength of association between elements in two sets is determined by considering the cosine of the angle between two feature vectors. When two vectors are exactly the same, the angle between them is 0, and the cosine of the angle between them is ; when the vectors are orthogonal, the cosine value is 0. After obtaining the term weights (wij, wlj) of all words by, it's easy to apply cosine similarity to compute the similarity of two sentences. The cosine similarity between two sentences (si and sl) is defined as the following formula: sim cs(, ) = =, i, l=,,n. () where is the number of similar words between both sentences, the weights of words in si and sl sentences. ii. Longest Common Subsequence (LCS) is the total number of the Longest Common Subsequence (LCS) is a technique that applied on the problem of deriving patterns in a set of sequences, it is given as input a set of (related or partially related) sequences, and the goal is to find a set of patterns that are common to all or most of the sequences in the set. A good algorithm for this problem should output patterns that are of high sensitivity and specificity. Suppose there 543

3 are closely related terminologies for multiple sequences, namely Longest Common Subsequence (LCS). Given sequences S= s sm and T = t tn, S is the subsequence of T if for each j m, i < i< < im n, sj = j i t. Given a set of sequences S+ = {S, S,, Sk}, the LCS of S+ is the longest. Possible sequence T such that it is a subsequence of each and every sequence in S at the same time. Emphasizing the given set of sequences, their LCS and pattern are related; LCS represent different aspects of these sequences profile and it can all be used for sequence comparisons and analysis. C. Classification Based Machine Learning Since presenting all syntactic and semantic rules of a language to algorithm is a cumbersome task, for this reason different types of algorithms are made that can receive different examples and have the learning ability and can preview the user' expected response easily. Using machine learning, we can generate systems that includes thousands features of questions and do classification those questions automatically. This action increases the productivity rate of QAS []. Any text classification algorithm can be employed such SVM to classify the texts based on the purpose of information that included in this text [5, 6, ]. For example, the text that talk about places is referring to where questions and the text that talk about date and time refer to when questions and so on. i. Support Vector Machine (SVM) SVM is a useful technique for data classification and it is easier to be implemented than other classification methods such as Neural Networks. A classification task usually involves separating data into training and testing sets. Each instance in the training set contains one target value (i.e. the class labels) and several attributes (i.e. the features or observed variables). The goal of SVM is to produce a model (based on the training data) which predicts the target values of the test data given only the test data attributes. Given a training set of instance-label pairs (xi, yi), i =,..., l where xi Rn and y {, }, the support vector machines (SVM) [8] require the solution of the following optimization problem: Here training vectors xi are mapped into a higher (maybe infinite) dimensional space by the function φ. SVM finds a linear separating hyperplane with the maximal margin in this higher dimensional space. C > 0 is the penalty parameter of the error term. Furthermore, K (xi, xj) φ(xi) T φ(xj ) is called the kernel function. Though new kernels are being proposed by researchers, beginners may find in SVM books the following four basic kernels: Linear: K (xi, xj) = x T i xj. Polynomial: K (xi, xj) = (γxi T xj + r) d, γ > 0. Radial basis function (RBF): K (xi, xj) = exp( γkxi xjk ), γ > 0. Sigmoid: K (xi, xj) = tanh (γxi T xj + r). Here, γ, r, and d are kernel parameters. III. RESEARCH DOMAIN This research focuses on question answering system for Hadiths. The domain of this research is as the following: i. Hadiths Sources: Albukary documents of Hadiths is the main documents source of this research due to many reasons such as Albukary documents of Hadith considered as the most trust sources of Hadiths. These documents are standard references in the Islam world, and Albukhary source contain large volume of Hadiths in various subjects. Thus, it is required to classify and analyze the documents based users query efficiently. ii. Language of Hadiths documents: formal English translate of Hadiths is the language of Hadiths documents in this research. The non Arabic people face difficulty in provide effective concepts of Hadiths queries more than the Arabic people. iii. Hadiths Subjects: pray and fasting are the main two Hadiths subjects in this research. These subjects involve the Muslim daily activities. The other subjects such as Hajj, Zakat, and Al-shahadateen are accomplished by Muslims for one time in the life or time yearly. iv. Questions Types: this research focuses on classify users queries and Hadiths documents depend on two types of questions; () Where, and () When. These types of questions are related to places and time classes. Pray and fasting questions and documents are mostly about places and time. IV. METHOD The research method involves two important directions that effects on question answering system accuracy. Firstly, the users could not have the effective skills to provide their questions in right way. For example, the query typing of what is Malaysia capital? is better than typing give me cities in Malaysia. Therefore, the question answering system could address this challenge using NLP methods. Secondly, the document classifications according to queries types (i.e. When questions) using machine learning methods could improve the accuracy of provided answers. The NLP and machine learning methods selection of question answering system for Hadiths domain take into account many points based on question answering system aspects and research scope. These points are as the following: The selected domain: there are two main domains of question answering system which are open domains and close domains. The Hadiths domain considered as close domain because it is focuses on information based on same context rather than tourism (i.e. open domain) which 5443

4 focuses on many fields like weather and hotels bookings. However, the Hadith domain contains large information about various related fields e.g. Pray, Zakat, Fasting, Alshadteen, and Pilgrimage. Thus, this research focuses on two main fields which are Pray and fasting due to its relation with daily activities of people. The other fields like pilgrimage are done one time in the life. Type of users query: there are two main types of questions which are open and restricted questions. In this research the restricted type is adopted in order to provide more accurate answers. The restricted type of questions help the users to manage their query using define keywords such as what, when, where, and how questions. On other hand, the documents or answers can be managed according question type which increases the opportunity to retrieve accurate answers from large documents. Specifically, this research focuses on when and where questions due to nature of selected Hadith fields. Usually, the fasting subject is related to time (i.e. When ) and the Pray is related to time and places (i.e. where and when ). Architecture of question answering systems: the question answering systems involve two main directions in order to increase the opportunities of provide accurate answers; () query analysis, () and documents or answers management. The query analysis can accomplished effectively using many methods such as tokenization, stop-word removing, and N-gram. On the other side, one of most effective methods to classify the documents based on specific indicators (i.e. questions types) such SVM method. Another important process of question answering system is the concepts similarity which support the answers retrieving based on similarity measuring between query and system documents. Also, the similarity is important to enhance the users query supporting ontology that contains standard or right concepts of the related domain. The Cosine Similarity (CS) and Long Common Similarity (LCS) are the most common similarity measuring techniques in question answering systems and the WordNet is used widely for the purpose of extract and replace the weak concepts in queries by right concepts. Consequently, Fig. illustrates the methodology of this research according to selected methods of question answering system. The methodology can be described as three main phases which are preprocessing phase, similarity measuring phase, and classification phase. The corpus consists of Hadiths Documents related to pray and Fasting Subjects To classify the Hadiths documents according to proposed Subjects. To classify each documents according to question type i.e. places for where and time for when. Hadiths documents classified according to subjects and question type. Classification Phase SVM Hadiths Documents Classification Question Type Group Where When DB of indexing based on group Retrieve the Answers Features Preprocessing and Similarity Matching Phase Users queries PREPROCESSING Stop Word Removal Tokenization N-Gram Similarity Matching Figure. Research Methodology Figure 3.: Research Methodology CS LCS Measure the similarity between updated query and Hadiths Documents V. EXPERIMENTAL DATA Analyze question type through identifiers i.e. Mecca is place identifier which indicate where question. Remove the insignificant words such as and, the, is split the query as single tokens generate N-gram list depend on tokens number i.e. WordNet Enrich query concepts by formal Hadiths concepts using WordNet. The dataset of the proposed system consists of Hadiths documents about pray and fasting subjects. These documents were selected from Al-bukhari reference of true Hadiths. Al-bukhari considered as one of most trusted references of Hadiths documents due to strong procedures that followed by writer to assure the truth of spoken Hadiths by prophet Mohammad. Prof. Dr. Ahamad Shaker Mahmmod validated the selected Hadiths as proposed dataset as pray and fasting documents who work in Islamic college in Baghdad University, and he considered as expert in Hadiths due to his large experience years (more than 0 years) in the domain of sacred Hadiths. Table illustrates the selected Hadiths numbers according to subject. TABLE PROPOSED DATASET Subject Number of Hadiths Pray 8 Fasting 50 Total The number of Hadith documents that related to when question and pray subject is. The number of Hadith documents that related to where question and pray subject is. The total number of Hadiths that related to when and where questions connecting with pray subject is 8. The number of Hadith documents that related to when question and fasting subject is. The number of Hadith documents that related to where question and fasting subject is 0 (i.e. fasting subjects cannot be related with places). The total number of Hadiths that related to when and where questions connecting with fasting subject is. Table summarizes the Hadiths documents classifications. 5454

5 TABLE SUMMARY OF HADITHS CORPUS Type of Question Pray Fasting When Where 0 Not related to when or where Total 8 50 The proposed system test was conducted based on queries that were selected based on proposed questions about pray and fasting subjects that provided by 5 students from UKM universities according to discussion of proposed system objectives. Table 3 presents the proposed queries according to question and subject classifications. TABLE 3 DIRECTION OF TESTED QUIRES Query Subject Question Proposed Query Type Q Pray When When is the five time of pray for Muslims? Q Pray Where Where was the first Friday prayers? Q3 Fasting When When is the fasting month of Muslims Q4 Pray When When does the Muslims can pray for eid? Q5 Pray When When do you pray Maghrib? Q6 Pray Where Where was the first qibla of Muslims? Q7 Fasting When When does fasting begin? Q8 Pray Where When can be Muslims taraweeh prayers? Q9 Pray When When Should the Traveler Shorten the Prayer? Q Fasting When When does fasting end? Q Pray When When does the Muslim pray for God? Q Pray When When is time of al-fajr prayer? VI. Test #: Cosine Similarity EXPERIMENTAL DATA This test is conducted through using only the cosine similarity technique to measure the similarity between user query and answers documents. The accuracy results of proposed question answering system based on CS technique. The accuracy scores computed depend on the queries precision and recalls where recall = T/(T+(Hadith- T )), Precision= T/(T+(N.H-T)) and F_Score= (recall * precision)/ (recall + precision). The most accurate F_score (0.74) is belonging to third query (When is the fasting month of Muslims?) while the lowest F_score (0.55) is belong to 8th query (when can be Muslims Taraweeh prayers?). The accuracy results for the queries based on cosine similarity technique are; 0.7 for Q, 0.70 for Q, 0.74 for Q3, 0.6 for Q4, 0.73 for Q5, 0.6 for Q6, 0.56 for Q7, 0.55 for Q8, 0.60 for Q9, 0.7 for Q, 0.7 for Q, and 0.73 for Q. The average of answers accuracy of all tested queries record 67%. Thus, the accuracy results of cosine similarity technique considered acceptable, but it could be enhanced supporting other methods to provide answers that are more accurate. Test #: Longest Common Subsequence This test is conducted through using only the long common similarity technique to measure the similarity between user query and answers documents. The accuracy results of proposed question answering system based on LCS technique. The accuracy scores computed depend on the queries precision and recalls where recall = T/(T+(Hadith-T)), Precision= T/(T+(N.H-T)) and F_Score= (recall * precision)/ (recall + precision). The most accurate F_score () is belonging to nd query (where was the first Friday prayers?) while the lowest F_score (0.55) is belonging to 6 th query (When where was the first Qibla of Muslims?), and 9 th query (When Should the Traveler Shorten the Prayer?). The accuracy results for the queries based on long common similarity technique are; 0.67 for Q, for Q, 0.69 for Q3, 0.57 for Q4, 0.67 for Q5, 0.55 for Q6, 0.7 for Q7, 0.7 for Q8, 0.55 for Q9, 0.7 for Q, 0.75 for Q, and 0.57 for Q. The average of answers accuracy of all tested queries record 66%. It can be noticed that the average of accuracy results of CS and LCS are approximately same. Thus, these results could be enhanced supporting other methods to provide answers that are more accurate. Test #3: Combination of CS and LCS The combination between CS and LCS techniques was conducted through testing each proposed query using these two techniques and selects the better F-score of CS and LCS. For example, if F_score of first query using CS technique is higher than F_score of first query using LCS then the system will select F-Score of CS. The measurement of F-score and results selection was accomplished using proposed QA system. Thus, the combination results represent the best possible answers using similarities techniques. The accuracy results of proposed question answering system based on the combination of CS and LCS techniques. The most accurate F_score () is belonging to nd query (where was the first Friday prayers?) while the lowest F_score (0.60) is belonging to 9 th query (When Should the Traveler Shorten the Prayer?). The accuracy results for the queries based on CS and LCS combination are; 0.7 for Q, for Q, 0.74 for Q3, 0.6 for Q4, 0.73 for Q5, 0.6 for Q6, 0.7 for Q7, 0.7 for Q8, 0.60 for Q9, 0.7 for Q, 0.75 for Q, and 0.73 for Q. The average of F_score using combination of CS and LCS was recorded 70%. It can be noticed that combination of CS and LCS provide accurate

6 F_Score Recall Precision Hadith Query answers more than separate CS technique and LCS technique. Test #4: Combination of CS, LCS, and SVM This test is conducted based on two main steps; () classify the Hadiths documents using SVM method based on the questions types, and () measure the similarity through CS and LCS combination. SVM is used to classify Hadiths documents according to Hadiths subjects (Pray and Fasting), and proposed answers of questions types (Where and When). Then, the similarity between queries and documents was measured using the combination of CS and LCS techniques before calculate F-score of final extracted answers. Table 4 summarizes the accuracy results of proposed question answering system based on the combination of CS and LCS, in addition to classification based SVM. The accuracy scores computed depend on the queries precision and recalls where recall = T/(T+(Hadith-T )), Precision= T/(T+(N.H-T)) and F_Score= (recall * precision)/ (recall + precision). As noticed from Table 4, the average of F_score using combination of CS, LCS, and SVM was recorded 80%. The most accurate F_score (0.86) is belonging to nd query (where was the first friday prayers?) while the lowest F_score (0.73) is belonging to 8th and th queries. TABLE 4 ACCURACY MEASUREMENTS BASED ON CS, LCS, AND SVM Q Q Q3 Q4 Q5 Q6 Q7 Q8 Q9 Q Q Q Av Output system N.H 5 T 9 F VII. DISCUSSION ON THE FINDINGS According to findings of experimental results, the combination of CS, LCS, and SVM techniques record the highest accurate records of answers (80%) followed by the combination of CS and LCS techniques (70%), then CS technique (67%), and finally LCS technique (66%). SVM technique is plays important role to improve the accuracy results of proposed question answering system. The average accuracy results of all queries was improved by % when apply SVM with other techniques. On the other hand, most individual results of queries record accurate answers based on SVM with other techniques. SVM reduce the searching space of Hadiths documents through classify the Hadiths depend on proposed question types and documents subjects. The reducing of searching space increases the opportunities of retrieving true answers that match with users queries. This finding can be justified clearly through compare the results of SVM of pray and fasting subjects. the pray subject can classified as when and where question types documents but the fasting subject can be classified as when question type documents. the results of pray queries based on SVM is more accurate than Fasting queries due to possibility of minimize the searching space of pray subject more than the fasting subject. VIII. CONCLUSION There are many methods were applied to analyze the query needs of answers and update the query to be more effective based on the formal concepts of Hadiths. Preprocessing methods such as normalization, tokenization, stop-word removal, and N-gram were applied to analyze the concepts of users quires. the WordNet tool was applied to replace the weak concepts of queries by effective Hadiths concepts or synonyms. SVM technique was applied to reduce the searching space of answers and improve the possibility of retrieving accurate answers. SVM classify Hadiths documents as four main cluster which are; Pray documents for when question type, pray documents for where question type, fasting documents for when question type, and fasting documents for where question type. The results of experimental tests show that the proposed methods are effective to improve the accuracy of question answering system for Hadith domain. Significantly, SVM technique reduces the searching space of answers which improve the accuracy of provided answers. REFERENCES [] Xu-Dong Lin, Hong Peng, Bo Liu, Support Vector Machines for Text Categorization in Chinese Question Classification, College of Computer Science and Engineering, South China University of Technology, International Conference on Web Intelligence (WI 006 Main Conference Proceedings), IEEE, 006. [] Marcin Skowron, Kenji Araki, Evaluation of the New Feature Types for Question Classification with Support Vector Machines, Graduate School of Information Science and Technology Hokkaido University, Sapporo, , Japan, International Symposium on Communication and Information Technology ( ISCIT),

7 [3] Hakan Sundblad, Question Classification in Question Answering Systems, Thesis No. 0 ISSN , Department of Computer and Information Science Linkopings University, Linkoping, 007. [4] Dell Zhang, Wee Sun Lee, Question Classification using Support Vector Machines, National University of Singapore, Singapore- MIT Alliance, Toronto, Canada, 8-August, 003. [5] Harb.A, Michel Beigbeder, Jean-Jacques, Evaluation of Question Classification Systems Using Differing Features, Institute of Electrical and Electronics Engineers, 009. [6] Tan.W, Jianrong Cao, Hongyan Li, Algorithm of Shot Detection based on SVM with Modified Kernel Function, Shan Dong Jianzhu University, Jinan 50, China, International Conference on Artificial Intelligence and Computational Intelligence, IEEE, 009. [7] Gharehchopogh, Farhad Soleimanian, and Yaghoub Lotfi. "Machine Learning based Question Classification Methods in the Question Answering Systems."International Journal of Innovation and Applied Studies 4. (0): [8] Srihari, R. & Li,W. (000). Information extraction supported question answering, In Proceedings 8th Text Retrieval Conference (TREC-8), NIST Special Publication [9] Bhaskar.P, Pakray.P, Banerjee.S and Banerjee.S, 0, Question Answering System for QA4MRE, Department of Computer Science and Engineering, Jadavpur University, Kolkata, 70003, India. [] Ullman, Jeffrey D., Jure Leskovec, and Anand Rajaraman. "Mining of Massive Datasets." (0): [] Brants, T., Franz, A. 006." Web IT 5-gram Version ". ( upenn.edu/catalog/catalogentry.jsp?catalogid=ldc006t). [] Jian-fang, S., Zong-tian, L., & Jian-feng, F. 0. Event-network clustering using similarity. In Natural Computation (ICNC), 0 Sixth International Conference on (Vol. 8, pp ). IEEE. [] Madylova, A. & Oguducu, S A Taxonomy Based Semantic Similarity of Documents Using the Cosine Measure. Computer and Information Sciences, 009. ISCIS th International Symposium on, hlm [] Day.M, Chorng-Shyong Ong, Question Classification in English- Chinese Cross-Language Question Answering: An Integrated Genetic Algorithm and Machine Learning Approach, Institute of Information Science, Academia Sinica, Taiwan, Department of Information Management, National Taiwan University, Taiwan, IEEE, 007. [5] Molina-González, M. D., Martínez-Cámara, E., Martín-Valdivia, M.-T., & Perea-Ortega, J. M. (0). Semantic orientation for polarity classification in Spanish reviews. Expert Systems with Applications, 40(8), [6] Turney, P. D. (00). Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. Paper presented at the Proceedings of the 40th annual meeting on association for computational linguistics. [] Xu, T., Peng, Q., & Cheng, Y. (0). Identifying the semantic orientation of terms using S-HAL for sentiment analysis. Knowledge-Based Systems, 35, [8] Cortes, C., & Vapnik, V. (995). Support-vector networks. Machine learning,0(3),

Question Answering. CS486 / 686 University of Waterloo Lecture 23: April 1 st, CS486/686 Slides (c) 2014 P. Poupart 1

Question Answering. CS486 / 686 University of Waterloo Lecture 23: April 1 st, CS486/686 Slides (c) 2014 P. Poupart 1 Question Answering CS486 / 686 University of Waterloo Lecture 23: April 1 st, 2014 CS486/686 Slides (c) 2014 P. Poupart 1 Question Answering Extension to search engines CS486/686 Slides (c) 2014 P. Poupart

More information

An Efficient Indexing Approach to Find Quranic Symbols in Large Texts

An Efficient Indexing Approach to Find Quranic Symbols in Large Texts Indian Journal of Science and Technology, Vol 7(10), 1643 1649, October 2014 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 An Efficient Indexing Approach to Find Quranic Symbols in Large Texts Vahid

More information

TEXT MINING TECHNIQUES RORY DUTHIE

TEXT MINING TECHNIQUES RORY DUTHIE TEXT MINING TECHNIQUES RORY DUTHIE OUTLINE Example text to extract information. Techniques which can be used to extract that information. Libraries How to measure accuracy. EXAMPLE TEXT Mr. Jack Ashley

More information

ECE 5984: Introduction to Machine Learning

ECE 5984: Introduction to Machine Learning ECE 5984: Introduction to Machine Learning Topics: SVM Multi-class SVMs Neural Networks Multi-layer Perceptron Readings: Barber 17.5, Murphy 16.5 Dhruv Batra Virginia Tech HW2 Graded Mean 66/61 = 108%

More information

Information Extraction. CS6200 Information Retrieval (and a sort of advertisement for NLP in the spring)

Information Extraction. CS6200 Information Retrieval (and a sort of advertisement for NLP in the spring) Information Extraction CS6200 Information Retrieval (and a sort of advertisement for NLP in the spring) Information Extraction Automatically extract structure from text annotate document using tags to

More information

Intelligent Agent for Information Extraction from Arabic Text without Machine Translation

Intelligent Agent for Information Extraction from Arabic Text without Machine Translation Intelligent Agent for Information Extraction from Arabic Text without Machine Translation Tarek Helmy * Abdirahman Daud Information and Computer Science Department, College of Computer Science and Engineering,

More information

A Question Answering System on Holy Quran Translation Based on Question Expansion Technique and Neural Network Classification

A Question Answering System on Holy Quran Translation Based on Question Expansion Technique and Neural Network Classification Journal of Computer Sciences Original Research Paper A Question Answering System on Holy Quran Translation Based on Question Expansion Technique and Neural Network Classification Suhaib Kh. Hamed and Mohd

More information

NPTEL NPTEL ONINE CERTIFICATION COURSE. Introduction to Machine Learning. Lecture-59 Ensemble Methods- Bagging,Committee Machines and Stacking

NPTEL NPTEL ONINE CERTIFICATION COURSE. Introduction to Machine Learning. Lecture-59 Ensemble Methods- Bagging,Committee Machines and Stacking NPTEL NPTEL ONINE CERTIFICATION COURSE Introduction to Machine Learning Lecture-59 Ensemble Methods- Bagging,Committee Machines and Stacking Prof. Balaraman Ravindran Computer Science and Engineering Indian

More information

Universiti Teknologi MARA. Ontology of Social Interaction Ethics in Al Adab Al - Mufrad by Using Semantic Web

Universiti Teknologi MARA. Ontology of Social Interaction Ethics in Al Adab Al - Mufrad by Using Semantic Web Universiti Teknologi MARA Ontology of Social Interaction Ethics in Al Adab Al - Mufrad by Using Semantic Web Saidah Nafisah Binti Mazli Sham Thesis submitted in fulfilment of the requirements for Bachelor

More information

Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith

Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith Halim Sayoud (&) USTHB University, Algiers, Algeria halim.sayoud@uni.de,

More information

MISSOURI S FRAMEWORK FOR CURRICULAR DEVELOPMENT IN MATH TOPIC I: PROBLEM SOLVING

MISSOURI S FRAMEWORK FOR CURRICULAR DEVELOPMENT IN MATH TOPIC I: PROBLEM SOLVING Prentice Hall Mathematics:,, 2004 Missouri s Framework for Curricular Development in Mathematics (Grades 9-12) TOPIC I: PROBLEM SOLVING 1. Problem-solving strategies such as organizing data, drawing a

More information

Using Machine Learning Algorithms for Categorizing Quranic Chapters by Major Phases of Prophet Mohammad s Messengership

Using Machine Learning Algorithms for Categorizing Quranic Chapters by Major Phases of Prophet Mohammad s Messengership Using Machine Learning Algorithms for Categorizing Quranic Chapters by Major Phases of Prophet Mohammad s Messengership Mohamadou Nassourou Department of Computer Philology & Modern German Literature University

More information

Identifying Anaphoric and Non- Anaphoric Noun Phrases to Improve Coreference Resolution

Identifying Anaphoric and Non- Anaphoric Noun Phrases to Improve Coreference Resolution Identifying Anaphoric and Non- Anaphoric Noun Phrases to Improve Coreference Resolution Vincent Ng Ng and Claire Cardie Department of of Computer Science Cornell University Plan for the Talk Noun phrase

More information

ECE 5424: Introduction to Machine Learning

ECE 5424: Introduction to Machine Learning ECE 5424: Introduction to Machine Learning Topics: SVM Multi-class SVMs Neural Networks Multi-layer Perceptron Readings: Barber 17.5, Murphy 16.5 Stefan Lee Virginia Tech HW2 Graded Mean 63/61 = 103% Max:

More information

Universiti Teknologi MARA. Zakat Calculation System for Academy of Contemporary Islamic Studies (ACIS), UiTM Melaka Campus Jasin

Universiti Teknologi MARA. Zakat Calculation System for Academy of Contemporary Islamic Studies (ACIS), UiTM Melaka Campus Jasin Universiti Teknologi MARA Zakat Calculation System for Academy of Contemporary Islamic Studies (ACIS), UiTM Melaka Campus Jasin Nurshafinas Binti Muhammad Thesis submitted in fulfilment of the requirements

More information

Prioritizing Issues in Islamic Economics and Finance

Prioritizing Issues in Islamic Economics and Finance Middle-East Journal of Scientific Research 15 (11): 1594-1598, 2013 ISSN 1990-9233 IDOSI Publications, 2013 DOI: 10.5829/idosi.mejsr.2013.15.11.11658 Prioritizing Issues in Islamic Economics and Finance

More information

Argument Harvesting Using Chatbots

Argument Harvesting Using Chatbots arxiv:1805.04253v1 [cs.ai] 11 May 2018 Argument Harvesting Using Chatbots Lisa A. CHALAGUINE a Fiona L. HAMILTON b Anthony HUNTER a Henry W. W. POTTS c a Department of Computer Science, University College

More information

The Impact of Oath Writing Style on Stylometric Features and Machine Learning Classifiers

The Impact of Oath Writing Style on Stylometric Features and Machine Learning Classifiers Journal of Computer Science Original Research Paper The Impact of Oath Writing Style on Stylometric Features and Machine Learning Classifiers 1 Ahmad Alqurnehand 2 Aida Mustapha 1 Faculty of Computer Science

More information

A Survey: Framework of an Information Retrieval for Malay Translated Hadith Document

A Survey: Framework of an Information Retrieval for Malay Translated Hadith Document A Survey: Framework of an Information Retrieval for Malay Translated Hadith Document Nurul Syeilla Syazhween Zulkefli 1,*, Nurazzah Abdul Rahman 1, and Mazidah Puteh 2 1 Faculty of Computer and Mathematical

More information

NPTEL NPTEL ONLINE CERTIFICATION COURSE. Introduction to Machine Learning. Lecture 31

NPTEL NPTEL ONLINE CERTIFICATION COURSE. Introduction to Machine Learning. Lecture 31 NPTEL NPTEL ONLINE CERTIFICATION COURSE Introduction to Machine Learning Lecture 31 Prof. Balaraman Ravindran Computer Science and Engineering Indian Institute of Technology Madras Hinge Loss Formulation

More information

Artificial Intelligence Prof. Deepak Khemani Department of Computer Science and Engineering Indian Institute of Technology, Madras

Artificial Intelligence Prof. Deepak Khemani Department of Computer Science and Engineering Indian Institute of Technology, Madras (Refer Slide Time: 00:26) Artificial Intelligence Prof. Deepak Khemani Department of Computer Science and Engineering Indian Institute of Technology, Madras Lecture - 06 State Space Search Intro So, today

More information

Extracting the Semantics of Understood-and- Pronounced of Qur anic Vocabularies Using a Text Mining Approach

Extracting the Semantics of Understood-and- Pronounced of Qur anic Vocabularies Using a Text Mining Approach Islamic University - Gaza Deanery of Graduate Studies Faculty of Information Technology الجامعة اإلسالمية غزة عمادة الد ارسات العميا كمية تكنولوجيا المعمومات Extracting the Semantics of Understood-and-

More information

McDougal Littell High School Math Program. correlated to. Oregon Mathematics Grade-Level Standards

McDougal Littell High School Math Program. correlated to. Oregon Mathematics Grade-Level Standards Math Program correlated to Grade-Level ( in regular (non-capitalized) font are eligible for inclusion on Oregon Statewide Assessment) CCG: NUMBERS - Understand numbers, ways of representing numbers, relationships

More information

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 21

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 21 6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 21 The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare

More information

USER AWARENESS ON THE AUTHENTICITY OF HADITH IN THE INTERNET: A CASE STUDY

USER AWARENESS ON THE AUTHENTICITY OF HADITH IN THE INTERNET: A CASE STUDY 1 USER AWARENESS ON THE AUTHENTICITY OF HADITH IN THE INTERNET: A CASE STUDY Nurul Nazariah Mohd Zaidi nazariahzaidi25@gmail.com Dr. Mesbahul Hoque Chowdhury mesbahul@usim.edu.my Faculty of Quranic and

More information

1. Introduction Formal deductive logic Overview

1. Introduction Formal deductive logic Overview 1. Introduction 1.1. Formal deductive logic 1.1.0. Overview In this course we will study reasoning, but we will study only certain aspects of reasoning and study them only from one perspective. The special

More information

The UPV at 2007

The UPV at 2007 The UPV at QA@CLEF 2007 Davide Buscaldi and Yassine Benajiba and Paolo Rosso and Emilio Sanchis Dpto. de Sistemas Informticos y Computación (DSIC), Universidad Politcnica de Valencia, Spain {dbuscaldi,

More information

ECE 5424: Introduction to Machine Learning

ECE 5424: Introduction to Machine Learning ECE 5424: Introduction to Machine Learning Topics: (Finish) Model selection Error decomposition Bias-Variance Tradeoff Classification: Naïve Bayes Readings: Barber 17.1, 17.2, 10.1-10.3 Stefan Lee Virginia

More information

StoryTown Reading/Language Arts Grade 2

StoryTown Reading/Language Arts Grade 2 Phonemic Awareness, Word Recognition and Fluency 1. Identify rhyming words with the same or different spelling patterns. 2. Read regularly spelled multi-syllable words by sight. 3. Blend phonemes (sounds)

More information

THE PROFIT EFFICIENCY: EVIDENCE FROM ISLAMIC BANKS IN INDONESIA

THE PROFIT EFFICIENCY: EVIDENCE FROM ISLAMIC BANKS IN INDONESIA THE PROFIT EFFICIENCY: EVIDENCE FROM ISLAMIC BANKS IN INDONESIA Muryani Arsal, Nik Intan Norhan bt Abdul Hamid Faculty of Management, ABSTRACT This study investigates the profit efficiency of Indonesia

More information

Prentice Hall Literature: Timeless Voices, Timeless Themes, Bronze Level '2002 Correlated to: Oregon Language Arts Content Standards (Grade 7)

Prentice Hall Literature: Timeless Voices, Timeless Themes, Bronze Level '2002 Correlated to: Oregon Language Arts Content Standards (Grade 7) Prentice Hall Literature: Timeless Voices, Timeless Themes, Bronze Level '2002 Oregon Language Arts Content Standards (Grade 7) ENGLISH READING: Comprehend a variety of printed materials. Recognize, pronounce,

More information

Order-Planning Neural Text Generation from Structured Data

Order-Planning Neural Text Generation from Structured Data Order-Planning Neural Text Generation from Structured Data Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Sujian Li, Baobao Chang, Zhifang Sui Institute of Computational Linguistics, Peking University

More information

Natural Language Processing (NLP) 10/30/02 CS470/670 NLP (10/30/02) 1

Natural Language Processing (NLP) 10/30/02 CS470/670 NLP (10/30/02) 1 Natural Language Processing (NLP) 10/30/02 CS470/670 NLP (10/30/02) 1 NLP Definition a range of computational techniques CS470/670 NLP (10/30/02) 2 NLP Definition (cont d) a range of computational techniques

More information

Information Retrieval LIS 544 IMT 542 INSC 544

Information Retrieval LIS 544 IMT 542 INSC 544 Information Retrieval LIS 544 IMT 542 INSC 544 Welcome! Your instructors Jeff Huang lazyjeff@uw.edu Shawn Walker stw3@uw.edu Introductions Name Program, year Previous school(s) Most interesting thing you

More information

Macmillan/McGraw-Hill SCIENCE: A CLOSER LOOK 2011, Grade 3 Correlated with Common Core State Standards, Grade 3

Macmillan/McGraw-Hill SCIENCE: A CLOSER LOOK 2011, Grade 3 Correlated with Common Core State Standards, Grade 3 Macmillan/McGraw-Hill SCIENCE: A CLOSER LOOK 2011, Grade 3 Common Core State Standards for Literacy in History/Social Studies, Science, and Technical Subjects, Grades K-5 English Language Arts Standards»

More information

Analyzing the activities of visitors of the Leiden Ranking website

Analyzing the activities of visitors of the Leiden Ranking website Analyzing the activities of visitors of the Leiden Ranking website Nees Jan van Eck and Ludo Waltman Centre for Science and Technology Studies, Leiden University, The Netherlands {ecknjpvan, waltmanlr}@cwts.leidenuniv.nl

More information

Ms. Shruti Aggarwal Assistant Professor S.G.G.S.W.U. Fatehgarh Sahib

Ms. Shruti Aggarwal Assistant Professor S.G.G.S.W.U. Fatehgarh Sahib Ms. Shruti Aggarwal S.G.G.S.W.U. Fatehgarh Sahib Email: shruti_cse@sggswu.org Area of Specialization: Data Mining, Software Engineering, Databases Subjects Taught Languages Fundamentals of Computers, C,

More information

Winning on the Merits: The Joint Effects of Content and Style on Debate Outcomes

Winning on the Merits: The Joint Effects of Content and Style on Debate Outcomes Winning on the Merits: The Joint Effects of Content and Style on Debate Outcomes Lu Wang 1, Nick Beauchamp 2,3, Sarah Shugars 3, Kechen Qin 1 1 College of Computer and Information Science 2 Department

More information

Gesture recognition with Kinect. Joakim Larsson

Gesture recognition with Kinect. Joakim Larsson Gesture recognition with Kinect Joakim Larsson Outline Task description Kinect description AdaBoost Building a database Evaluation Task Description The task was to implement gesture detection for some

More information

South Carolina English Language Arts / Houghton Mifflin English Grade Three

South Carolina English Language Arts / Houghton Mifflin English Grade Three Reading Goal (R) The student will draw upon a variety of strategies to comprehend, interpret, analyze, and evaluate what he or she reads. READING PROCESS AND COMPREHENSION 3-R1 The student will integrate

More information

Grade 6 correlated to Illinois Learning Standards for Mathematics

Grade 6 correlated to Illinois Learning Standards for Mathematics STATE Goal 6: Demonstrate and apply a knowledge and sense of numbers, including numeration and operations (addition, subtraction, multiplication, division), patterns, ratios and proportions. A. Demonstrate

More information

Anaphora Resolution in Hindi Language

Anaphora Resolution in Hindi Language International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 609-616 International Research Publications House http://www. irphouse.com /ijict.htm Anaphora

More information

Perception of Individual Consumers toward Islamic Banking Products and Services in Pakistan

Perception of Individual Consumers toward Islamic Banking Products and Services in Pakistan Vol.5 14 Perception of Individual Consumers toward Islamic Banking Products and Services in Pakistan Amer Sohail Assistant Professor Department of Management Sciences, University of Sargodha,Gujranwala

More information

Anaphora Resolution in Biomedical Literature: A Hybrid Approach

Anaphora Resolution in Biomedical Literature: A Hybrid Approach Anaphora Resolution in Biomedical Literature: A Hybrid Approach Jennifer D Souza and Vincent Ng Human Language Technology Research Institute University of Texas at Dallas Richardson, TX 75083-0688 {jld082000,vince}@hlt.utdallas.edu

More information

Studying Adaptive Learning Efficacy using Propensity Score Matching

Studying Adaptive Learning Efficacy using Propensity Score Matching Studying Adaptive Learning Efficacy using Propensity Score Matching Shirin Mojarad 1, Alfred Essa 1, Shahin Mojarad 1, Ryan S. Baker 2 McGraw-Hill Education 1, University of Pennsylvania 2 {shirin.mojarad,

More information

Agnostic KWIK learning and efficient approximate reinforcement learning

Agnostic KWIK learning and efficient approximate reinforcement learning Agnostic KWIK learning and efficient approximate reinforcement learning István Szita Csaba Szepesvári Department of Computing Science University of Alberta Annual Conference on Learning Theory, 2011 Szityu

More information

Prentice Hall Literature: Timeless Voices, Timeless Themes, Silver Level '2002 Correlated to: Oregon Language Arts Content Standards (Grade 8)

Prentice Hall Literature: Timeless Voices, Timeless Themes, Silver Level '2002 Correlated to: Oregon Language Arts Content Standards (Grade 8) Prentice Hall Literature: Timeless Voices, Timeless Themes, Silver Level '2002 Oregon Language Arts Content Standards (Grade 8) ENGLISH READING: Comprehend a variety of printed materials. Recognize, pronounce,

More information

A New Parameter for Maintaining Consistency in an Agent's Knowledge Base Using Truth Maintenance System

A New Parameter for Maintaining Consistency in an Agent's Knowledge Base Using Truth Maintenance System A New Parameter for Maintaining Consistency in an Agent's Knowledge Base Using Truth Maintenance System Qutaibah Althebyan, Henry Hexmoor Department of Computer Science and Computer Engineering University

More information

The SAT Essay: An Argument-Centered Strategy

The SAT Essay: An Argument-Centered Strategy The SAT Essay: An Argument-Centered Strategy Overview Taking an argument-centered approach to preparing for and to writing the SAT Essay may seem like a no-brainer. After all, the prompt, which is always

More information

Anaphora Resolution in Biomedical Literature: A

Anaphora Resolution in Biomedical Literature: A Anaphora Resolution in Biomedical Literature: A Hybrid Approach Jennifer D Souza and Vincent Ng Human Language Technology Research Institute The University of Texas at Dallas 1 What is Anaphora Resolution?

More information

Georgia Quality Core Curriculum

Georgia Quality Core Curriculum correlated to the Grade 8 Georgia Quality Core Curriculum McDougal Littell 3/2000 Objective (Cite Numbers) M.8.1 Component Strand/Course Content Standard All Strands: Problem Solving; Algebra; Computation

More information

STI 2018 Conference Proceedings

STI 2018 Conference Proceedings STI 2018 Conference Proceedings Proceedings of the 23rd International Conference on Science and Technology Indicators All papers published in this conference proceedings have been peer reviewed through

More information

Automatic Recognition of Tibetan Buddhist Text by Computer. Masami Kojima*1, Yoshiyuki Kawazoe*2 and Masayuki Kimura*3

Automatic Recognition of Tibetan Buddhist Text by Computer. Masami Kojima*1, Yoshiyuki Kawazoe*2 and Masayuki Kimura*3 Automatic Recognition of Tibetan Buddhist Text by Computer Masami Kojima*1, Yoshiyuki Kawazoe*2 and Masayuki Kimura*3 *1 Dept. of Electrical Communication, Tohoku Institute of Technology ( E-mail : mkojima@tohtech.ac.jp

More information

Deep Neural Networks [GBC] Chap. 6, 7, 8. CS 486/686 University of Waterloo Lecture 18: June 28, 2017

Deep Neural Networks [GBC] Chap. 6, 7, 8. CS 486/686 University of Waterloo Lecture 18: June 28, 2017 Deep Neural Networks [GBC] Chap. 6, 7, 8 CS 486/686 University of Waterloo Lecture 18: June 28, 2017 Outline Deep Neural Networks Gradient Vanishing Rectified linear units Overfitting Dropout Breakthroughs

More information

AUTHORSHIP DISCRIMINATION ON QURAN AND HADITH USING DISCRIMINATIVE LEAVE-ONE-OUT CLASSIFICATION

AUTHORSHIP DISCRIMINATION ON QURAN AND HADITH USING DISCRIMINATIVE LEAVE-ONE-OUT CLASSIFICATION AUTHORSHIP DISCRIMIATIO O QURA AD HADITH USIG DISCRIMIATIVE LEAVE-OE-OUT CLASSIFICATIO Halim Sayoud http://sayoud.net USTHB University halim.sayoud@uni.de ABSTRACT In this survey, we try to make an investigation

More information

Keyword based Clustering Technique for Collections of Hadith Chapters

Keyword based Clustering Technique for Collections of Hadith Chapters Keyword based Clustering Technique for Collections of Hadith Chapters Puteri N. E, Nohuddin 1, a, Zuraini Zainol 2, b, Kuan Fook Chao 2, c, A. Imran Nordin 1, d, and M. Tarhamizwan A. H. James 2, e 1 Institute

More information

Network Analysis of the Four Gospels and the Catechism of the Catholic Church

Network Analysis of the Four Gospels and the Catechism of the Catholic Church Network Analysis of the Four Gospels and the Catechism of the Catholic Church Hajime Murai and Akifumi Tokosumi Department of Value and Decision Science, Tokyo Institute of Technology 2-12-1, Ookayama,

More information

The Meaning of Muslim-Friendly Destination: Perspective of Malaysian and Korean Scholars

The Meaning of Muslim-Friendly Destination: Perspective of Malaysian and Korean Scholars ISBN 978-93-84422-37-0 11th International Conference on Arts, Humanities, Social Sciences and Corporate Social Responsibilities (AHSCSR-17) Dec. 25-26, 2017 Bangkok (Thailand) The Meaning of Muslim-Friendly

More information

That's Your Evidence?: Using Mechanical Turk To Develop A Computational Account Of Debate And Argumentation In Online Forums

That's Your Evidence?: Using Mechanical Turk To Develop A Computational Account Of Debate And Argumentation In Online Forums That's Your Evidence?: Using Mechanical Turk To Develop A Computational Account Of Debate And Argumentation In Online Forums Natural Language and Dialogue Systems Lab Prof. Marilyn Walker Debate and Deliberation:

More information

Sentiment Flow! A General Model of Web Review Argumentation

Sentiment Flow! A General Model of Web Review Argumentation Sentiment Flow! A General Model of Web Review Argumentation Henning Wachsmuth, Johannes Kiesel, Benno Stein henning.wachsmuth@uni-weimar.de www.webis.de! Web reviews across domains This book was different.

More information

Macmillan/McGraw-Hill SCIENCE: A CLOSER LOOK 2011, Grade 4 Correlated with Common Core State Standards, Grade 4

Macmillan/McGraw-Hill SCIENCE: A CLOSER LOOK 2011, Grade 4 Correlated with Common Core State Standards, Grade 4 Macmillan/McGraw-Hill SCIENCE: A CLOSER LOOK 2011, Grade 4 Common Core State Standards for Literacy in History/Social Studies, Science, and Technical Subjects, Grades K-5 English Language Arts Standards»

More information

Houghton Mifflin MATHEMATICS

Houghton Mifflin MATHEMATICS 2002 for Mathematics Assessment NUMBER/COMPUTATION Concepts Students will describe properties of, give examples of, and apply to real-world or mathematical situations: MA-E-1.1.1 Whole numbers (0 to 100,000,000),

More information

Reference Resolution. Regina Barzilay. February 23, 2004

Reference Resolution. Regina Barzilay. February 23, 2004 Reference Resolution Regina Barzilay February 23, 2004 Announcements 3/3 first part of the projects Example topics Segmentation Identification of discourse structure Summarization Anaphora resolution Cue

More information

Reference Resolution. Announcements. Last Time. 3/3 first part of the projects Example topics

Reference Resolution. Announcements. Last Time. 3/3 first part of the projects Example topics Announcements Last Time 3/3 first part of the projects Example topics Segmentation Symbolic Multi-Strategy Anaphora Resolution (Lappin&Leass, 1994) Identification of discourse structure Summarization Anaphora

More information

Artificial Intelligence: Valid Arguments and Proof Systems. Prof. Deepak Khemani. Department of Computer Science and Engineering

Artificial Intelligence: Valid Arguments and Proof Systems. Prof. Deepak Khemani. Department of Computer Science and Engineering Artificial Intelligence: Valid Arguments and Proof Systems Prof. Deepak Khemani Department of Computer Science and Engineering Indian Institute of Technology, Madras Module 02 Lecture - 03 So in the last

More information

Reductio ad Absurdum, Modulation, and Logical Forms. Miguel López-Astorga 1

Reductio ad Absurdum, Modulation, and Logical Forms. Miguel López-Astorga 1 International Journal of Philosophy and Theology June 25, Vol. 3, No., pp. 59-65 ISSN: 2333-575 (Print), 2333-5769 (Online) Copyright The Author(s). All Rights Reserved. Published by American Research

More information

In the name of Allah, the Beneficent and Merciful S/5/100 report 1/12/1982 [December 1, 1982] Towards a worldwide strategy for Islamic policy (Points

In the name of Allah, the Beneficent and Merciful S/5/100 report 1/12/1982 [December 1, 1982] Towards a worldwide strategy for Islamic policy (Points In the name of Allah, the Beneficent and Merciful S/5/100 report 1/12/1982 [December 1, 1982] Towards a worldwide strategy for Islamic policy (Points of Departure, Elements, Procedures and Missions) This

More information

A Quranic Quote Verification Algorithm for Verses Authentication

A Quranic Quote Verification Algorithm for Verses Authentication 2012 International Conference on Innovations in Information Technology (IIT) A Quranic Quote Verification Algorithm for Verses Authentication Abdulrhman Alshareef 1,2, Abdulmotaleb El Saddik 1 1 Multimedia

More information

UNIVERSITI TEKNOLOGI MARA AN EXPLORATORY OF CONCEPTUAL MODEL OF POVERTY INFORMATION CROWDSOURCING FOR ZAKAT DISTRIBUTION UMMU FATIH AH BT MOHD BAHRIN

UNIVERSITI TEKNOLOGI MARA AN EXPLORATORY OF CONCEPTUAL MODEL OF POVERTY INFORMATION CROWDSOURCING FOR ZAKAT DISTRIBUTION UMMU FATIH AH BT MOHD BAHRIN UNIVERSITI TEKNOLOGI MARA AN EXPLORATORY OF CONCEPTUAL MODEL OF POVERTY INFORMATION CROWDSOURCING FOR ZAKAT DISTRIBUTION UMMU FATIH AH BT MOHD BAHRIN Report submitted in partial fulfillment of the requirements

More information

Artificial Intelligence Prof. P. Dasgupta Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur

Artificial Intelligence Prof. P. Dasgupta Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Artificial Intelligence Prof. P. Dasgupta Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Lecture- 9 First Order Logic In the last class, we had seen we have studied

More information

COACHING THE BASICS: WHAT IS AN ARGUMENT?

COACHING THE BASICS: WHAT IS AN ARGUMENT? COACHING THE BASICS: WHAT IS AN ARGUMENT? Some people think that engaging in argument means being mad at someone. That s one use of the word argument. In debate we use a far different meaning of the term.

More information

TÜ Information Retrieval

TÜ Information Retrieval TÜ Information Retrieval Übung 2 Heike Adel, Sascha Rothe Center for Information and Language Processing, University of Munich May 8, 2014 1 / 17 Problem 1 Assume that machines in MapReduce have 100GB

More information

Outline of today s lecture

Outline of today s lecture Outline of today s lecture Putting sentences together (in text). Coherence Anaphora (pronouns etc) Algorithms for anaphora resolution Document structure and discourse structure Most types of document are

More information

Saint Bartholomew School Third Grade Curriculum Guide. Language Arts. Writing

Saint Bartholomew School Third Grade Curriculum Guide. Language Arts. Writing Language Arts Reading (Literature) Locate and respond to key details Determine the message or moral in a folktale, fable, or myth Describe the qualities and actions of a character Differentiate between

More information

China Buddhism Encyclopedia Online Website Project.

China Buddhism Encyclopedia Online Website Project. China Buddhism Encyclopedia Online Website Project Www.chinabuddhismencyclopedia.com About CBE Author and main coordinator of the project Vello Vaartnou Project launched in December 2012 Project is developed

More information

Buddha Images in Mudras Representing Days of a Week: Tactile Texture Design for the Blind

Buddha Images in Mudras Representing Days of a Week: Tactile Texture Design for the Blind Buddha Images in Mudras Representing Days of a Week: Tactile Texture Design for the Blind Chantana Insra Abstract The research Buddha Images in Mudras Representing Days of a Week: Tactile Texture Design

More information

Transcription ICANN London IDN Variants Saturday 21 June 2014

Transcription ICANN London IDN Variants Saturday 21 June 2014 Transcription ICANN London IDN Variants Saturday 21 June 2014 Note: The following is the output of transcribing from an audio. Although the transcription is largely accurate, in some cases it is incomplete

More information

Tuen Mun Ling Liang Church

Tuen Mun Ling Liang Church NCD insights Quality Characteristic ti Analysis & Trends for the Natural Church Development Journey of Tuen Mun Ling Liang Church January-213 Pastor for 27 years: Mok Hing Wan "Service attendance" "Our

More information

Measuring religious intolerance across Indonesian provinces

Measuring religious intolerance across Indonesian provinces Measuring religious intolerance across Indonesian provinces How do Indonesian provinces vary in the levels of religious tolerance among their Muslim populations? Which province is the most tolerant and

More information

Our Story with MCM. Shanghai Jiao Tong University. March, 2014

Our Story with MCM. Shanghai Jiao Tong University. March, 2014 Our Story with MCM Libin Wen, Jingyuan Wu and Cong Wang Shanghai Jiao Tong University March, 2014 1 Introduction to Our Group Be It Known That The Team Of With Faculty Advisor Of Was Designated As Administered

More information

THE SEVENTH-DAY ADVENTIST CHURCH AN ANALYSIS OF STRENGTHS, WEAKNESSES, OPPORTUNITIES, AND THREATS (SWOT) Roger L. Dudley

THE SEVENTH-DAY ADVENTIST CHURCH AN ANALYSIS OF STRENGTHS, WEAKNESSES, OPPORTUNITIES, AND THREATS (SWOT) Roger L. Dudley THE SEVENTH-DAY ADVENTIST CHURCH AN ANALYSIS OF STRENGTHS, WEAKNESSES, OPPORTUNITIES, AND THREATS (SWOT) Roger L. Dudley The Strategic Planning Committee of the General Conference of Seventh-day Adventists

More information

Resolving Direct and Indirect Anaphora for Japanese Definite Noun Phrases

Resolving Direct and Indirect Anaphora for Japanese Definite Noun Phrases Resolving Direct and Indirect Anaphora for Japanese Definite Noun Phrases Naoya Inoue,RyuIida, Kentaro Inui and Yuji Matsumoto An anaphoric relation can be either direct or indirect. In some cases, the

More information

correlated to the Massachussetts Learning Standards for Geometry C14

correlated to the Massachussetts Learning Standards for Geometry C14 correlated to the Massachussetts Learning Standards for Geometry C14 12/2003 2004 McDougal Littell Geometry 2004 correlated to the Massachussetts Learning Standards for Geometry Note: The parentheses at

More information

REQUIRED DOCUMENT FROM HIRING UNIT

REQUIRED DOCUMENT FROM HIRING UNIT Terms of reference GENERAL INFORMATION Title: Consultant for Writing on the Proposal of Zakat Trust Fund (International Consultant) Project Name: Social and Islamic Finance Reports to: Deputy Country Director,

More information

All They Know: A Study in Multi-Agent Autoepistemic Reasoning

All They Know: A Study in Multi-Agent Autoepistemic Reasoning All They Know: A Study in Multi-Agent Autoepistemic Reasoning PRELIMINARY REPORT Gerhard Lakemeyer Institute of Computer Science III University of Bonn Romerstr. 164 5300 Bonn 1, Germany gerhard@cs.uni-bonn.de

More information

This report is organized in four sections. The first section discusses the sample design. The next

This report is organized in four sections. The first section discusses the sample design. The next 2 This report is organized in four sections. The first section discusses the sample design. The next section describes data collection and fielding. The final two sections address weighting procedures

More information

MissionInsite Learning Series Compare Your Congregation To Your Community Slide 1 COMPARE YOUR CONGREGATION TO YOUR COMMUNITY USING CONGREGANT PLOT & THE COMPARATIVEINSITE REPORT This Series will cover:

More information

Inimitable Human Intelligence and The Truth on Morality. to life, such as 3D projectors and flying cars. In fairy tales, magical spells are cast to

Inimitable Human Intelligence and The Truth on Morality. to life, such as 3D projectors and flying cars. In fairy tales, magical spells are cast to 1 Inimitable Human Intelligence and The Truth on Morality Less than two decades ago, Hollywood films brought unimaginable modern creations to life, such as 3D projectors and flying cars. In fairy tales,

More information

APAS assistant flexible production assistant

APAS assistant flexible production assistant APAS assistant flexible production assistant 2 I APAS assistant APAS assistant I 3 Flexible automation for the smart factory of the future APAS family your partner on the path to tomorrow s production

More information

SYLLABUS. Department Syllabus. Philosophy of Religion

SYLLABUS. Department Syllabus. Philosophy of Religion SYLLABUS DATE OF LAST REVIEW: 02/2013 CIP CODE: 24.0101 SEMESTER: COURSE TITLE: Department Syllabus Philosophy of Religion COURSE NUMBER: PHIL 200 CREDIT HOURS: 3 INSTRUCTOR: OFFICE LOCATION: OFFICE HOURS:

More information

South Carolina English Language Arts / Houghton Mifflin Reading 2005 Grade Three

South Carolina English Language Arts / Houghton Mifflin Reading 2005 Grade Three Reading Goal (R) The student will draw upon a variety of strategies to comprehend, interpret, analyze, and evaluate what he or she reads. READING PROCESS AND COMPREHENSION 3-R1 The student will integrate

More information

Tools Andrew Black CS 305 1

Tools Andrew Black CS 305 1 Tools Andrew Black CS 305 1 Critical Thinking Everyone thinks, all the time Why Critical Thinking? Much of our thinking is biased, distorted, partial, uninformed, or down-right prejudiced. This costs us

More information

Introduction. I. Proof of the Minor Premise ( All reality is completely intelligible )

Introduction. I. Proof of the Minor Premise ( All reality is completely intelligible ) Philosophical Proof of God: Derived from Principles in Bernard Lonergan s Insight May 2014 Robert J. Spitzer, S.J., Ph.D. Magis Center of Reason and Faith Lonergan s proof may be stated as follows: Introduction

More information

Pearson myworld Geography Western Hemisphere 2011

Pearson myworld Geography Western Hemisphere 2011 A Correlation of Pearson Western Hemisphere 2011 Table of Contents Reading Standards for... 3 Writing Standards for... 9 A Correlation of, Reading Standards for Key Ideas and Details RH.6-8.1. Cite specific

More information

INF5020 Philosophy of Information: Ontology

INF5020 Philosophy of Information: Ontology WEEK 3, LECTURE a INF5020 Philosophy of Information: Ontology M. Naci Akkøk, Fall 2004 Page 1 THIS SESSION The goal History: We first talked about computation, complexity and looked at several definitions

More information

Prentice Hall U.S. History Modern America 2013

Prentice Hall U.S. History Modern America 2013 A Correlation of Prentice Hall U.S. History 2013 A Correlation of, 2013 Table of Contents Grades 9-10 Reading Standards for... 3 Writing Standards for... 9 Grades 11-12 Reading Standards for... 15 Writing

More information

Six Sigma Prof. Dr. T. P. Bagchi Department of Management Indian Institute of Technology, Kharagpur. Lecture No. # 18 Acceptance Sampling

Six Sigma Prof. Dr. T. P. Bagchi Department of Management Indian Institute of Technology, Kharagpur. Lecture No. # 18 Acceptance Sampling Six Sigma Prof. Dr. T. P. Bagchi Department of Management Indian Institute of Technology, Kharagpur Lecture No. # 18 Acceptance Sampling Good afternoon, we begin today we continue with our session on Six

More information

How many imputations do you need? A two stage calculation using a quadratic rule

How many imputations do you need? A two stage calculation using a quadratic rule Sociological Methods and Research, in press 2018 How many imputations do you need? A two stage calculation using a quadratic rule Paul T. von Hippel University of Texas, Austin Abstract 0F When using multiple

More information

Proceedings of the Meeting & workshop on Development of a National IT Strategy Focusing on Indigenous Content Development

Proceedings of the Meeting & workshop on Development of a National IT Strategy Focusing on Indigenous Content Development Ministry of Science, Research & Technology Iranian Information & Documentation Center (Research Center) Proceedings of the Meeting & workshop on Development of a National IT Strategy Focusing on Indigenous

More information