09 Feb, 2009, David Haley wrote in the 21st comment:

Votes: 0

Silenus said:

Progress I suspect has been made in this area though I was looking at that Blackburn book and it seems like that building a system which is close to the state of the art (the book is dated 2005) isn't too trickly

Frankly, it is often my impression that these things look simpler when in books than they are when you actually go to implement them. :wink:

Silenus said:

But in either case Prolog is probably not a good choice for handling this sort of thing except for with toy example systems.

Actually, for what it does, Prolog is very fast. If you were to implement such a prover yourself, you'd very likely end up reimplementing a lot of what the Warren machine does. I've worked in this field (logic) quite a bit actually. My MS was in AI with focus on logic, so implementation obviously came up alongside various theoretical concerns.

In a particular class I took and taught, programs with their own logic engines (including mine, FWIW) were invariably slower than programs that embedded Prolog.

But you are correct that Prolog is not a full FOL theorem prover. It's not really that its semantics are theoretically simpler, they're just different. (Herbrand vs. non-Herbrand interpretations, etc.)

A full FOL theorem prover is however likely to be far slower than Prolog. :shrug: It really comes down to what you need it to do. You might not have a choice in the matter, and need full FOL. But as I said, in practice a lot of people in the field use Prolog or something with similar semantics.

quixadhal said:

In any case, I think I'd want to use a language that was designed to work with strings, not C.

Well, once you do the parsing, the strings are basically incidental, as you are dealing with logical structures, not strings.

Silenus said:

As for representation of what you glean from the sentences you use as input you really do need first order logic I gather which is a superset of Codd's 8 operators for relational algebra- so a RDBMS probably does not cope well with this.

It's not just that it wouldn't cope well – it wouldn't be able to solve the problem in the first place. :wink:

09 Feb, 2009, David Haley wrote in the 22nd comment:

Votes: 0

aidil said:

The natural language parsing part could easily be done by dgd's string parser, which is often used for command parsers, but is a generic lr parser that can do many things (the dgd network package contains grammar to use the string parser to decode mudmode data for use with intermud 3 for example, and I have used it for other network protocols as well), and happens to be good at nlp.

Big, big difference between parsing commands and even strings into parts and/or parts of speech, and doing any kind of semantic analysis on the results of your parsing. "NLP" is a lot more than just parsing things.

09 Feb, 2009, Silenus wrote in the 23rd comment:

Votes: 0

As I said, I have only read a few books on the subject including Blackburn's. So take this all with a grain of salt:

Quote

Frankly, it is often my impression that these things look simpler when in books than they are when you actually go to implement them. :wink:

Always :tongue:. More seriously though I think this generally depends on how clear the material is presented and how laborious the work is. Looking over what was presented in Blackburn the parsing -> FOL step was very clearly presented and it appeared you could almost cut and paste the algorithms into some language and get them to work. It seemed about as straightforward as writing the compiler to IL layer which I am working on and off on atm. The book actually piqued my interest enough that I might actually try doing this- but it seems like without a good lexicon I am sort of stuck.

Quote

But you are correct that Prolog is not a full FOL theorem prover. It's not really that its semantics are theoretically simpler, they're just different. (Herbrand vs. non-Herbrand interpretations, etc.)

Hmm. I never heard this before. I always hear that Prolog was the result of some compromises to get a computationally tractable "subset" which is decidable. with negation as finite failure and procedural like cuts etc.

Quote

A full FOL theorem prover is however likely to be far slower than Prolog. :shrug: It really comes down to what you need it to do. You might not have a choice in the matter, and need full FOL. But as I said, in practice a lot of people in the field use Prolog or something with similar semantics.

Well Blackburn doesn't seem to like if IIRC since for certain english sentences in the representations he describes it just unfortunately is inadequate. He suggests using an open source theorem prover to do this job.

Quote

It's not just that it wouldn't cope well – it wouldn't be able to solve the problem in the first place. :wink:

Well you might be able to coerce it to- using something like a combination of C + PL/SQL or some such but I speculate it would be painful and quite slow. Representing term like structures in tables would I think be quite a nightmare. Better to rely on the latest term indexing techniques for that.

09 Feb, 2009, aidil wrote in the 24th comment:

Votes: 0

Quote

First step is still to have a parser that can deal with the typical structure and ambiguities of natural language. Beyond that, parse_string can do a bit more then split a string into parts by using production rules that can call native logic (lpc functions) that can manipulate the data (often used in 'simple' cases to directly interpret fragments, ie, assign 'meaning' to them)

Its far from a complete canned solution, and nothing that doesn't exist in other ways also, but it does address the initial parsing phase of nlp very well in a mud environment.

09 Feb, 2009, David Haley wrote in the 25th comment:

Votes: 0

Silenus said:

It seemed about as straightforward as writing the compiler to IL layer which I am working on and off on atm.

I'm going to assume you're talking about relative straightforwardness again because we just had a whole discussion about why that isn't a simple problem :tongue:
But Blackburn's book is a pretty good one. We probably read the same one, although I can't remember the title.

Silenus said:

Well, it's not really a subset, that's what I meant. I don't really mean to get into a theoretical logic religious war here, but I wouldn't call it a simplification per se. NAF for example is a choice made in how you describe the world. If you can't prove something, is the negation true? Or if you can't prove something, do you know nothing about it? In standard FOL, the inability to prove p(x) simply means that you can't prove it. In NAF, the inability to prove it means that it is false. Is that a "simplification", or just a different way of looking at the world? Definitely not a subset, though, because the rules are quite simply different.

Cuts are IIRC a programmer's tool, and are useful for efficient programming of complex recursions, but I don't believe they are strictly speaking necessary. To be honest I haven't used them much.

Silenus said:

I have to admit that I don't remember all the details of his representation. It's possible that you do need a standard theorem prover, yes. But it's also possible that he was trying to do more with the language than we'd like to here.

10 Feb, 2009, Scandum wrote in the 26th comment:

Votes: 0

amylase said:

Hi guys, thanks a lot for the replies. Put simply I am working on a linguistic bot. I am trying to improve on Eliza by giving her a sense of theme or topic rather than just firing open sentences based on discrete unrelated sentences. My bot will digest dialogues as they roll out, and try to figure out what topic is being talked about.

If you want the mud to revolve around the chat bot it'd probably be easiest to use Socketmud. It's a barebone codebase in the public domain which will leave your options wide open if you ever want to release the source code.

10 Feb, 2009, ShadowsDawn wrote in the 27th comment:

Votes: 0

Scandum said:

NakedMUD is built on top of SocketMUD, and is much more in a useable state without sacrificing the options you could have. It is also very actively developed. The developer releases approx. every 6 months, and is frequently in the dev chat channel to answer questions or give you examples of how things could be done with NM.

11 Feb, 2009, amylase wrote in the 28th comment:

Votes: 0

Hi everybody. Thanks so much for the replies. I realise natural language is a big area. Without aiming for anything too ambitious, I might just stick to my original attempt at 'looking for topic word'. My guess is with a good dictionary at hand and a not so complicated engine, it should be possible for my bot to look up this dictionary however many thousand times and make a pretty good guess at the current topic. That's my hypothesis. I will stick with LPMUD for a bit longer. I am using Dead Souls 2 on FluffOS at the moment and the combination is just great. See how far I can push things and if bottle neck occurs I'll then change over to something else. Thanks again for all the help and suggestions from everyone.

11 Feb, 2009, David Haley wrote in the 29th comment:

Votes: 0

I would suggest not dismissing "Eliza-bots" too quickly. You can do an awful lot if you throw in a little bit of state to Eliza. You might not get the full chain of inferences that you gave in your example, however you can do something fairly passable. Eliza itself has fooled people from time to time, and with just a little bit of contextualization it can be even better.

IMHO you get an awful lot out of slightly nicer conversation compared to no conversation at all. Nobody is expecting full AI here, being able to give the mobs word puzzles and the like. (Well, if they are, it's not a reasonable expectation I'd think.)

11 Feb, 2009, Sandi wrote in the 30th comment:

Votes: 0

Forgive me, as I haven't been following this stuff lately, but how does your bot understand the definitions it "looks up"?

I would expect a thesaurus to be more useful than a dictionary.

11 Feb, 2009, Igabod wrote in the 31st comment:

Votes: 0

Keep in mind that with any codebase you're bound to find limitations on one thing or another as to what you can do. The goal here is to find ways around said limitations. No matter which codebase you use, you're better off sticking to what you know than with trying to learn a whole new codebase. You can adventure out and look at all the codebases if you want, but if you've already made a lot of progress on one, why undo all your hard work by switching codebases?

Random Picks

The Burning Post II

Cruentus Apocalypsis

Throes of Creation