[Reasking, due to poor response. Doesn't anyone care about this? It shouldn't be too hard to get right.]
Searching for symbols within Yahoo!Answers is inconsistent,
here is what you get if you search on various symbols
(some are Alt-codes and some are HTML characters):
Most alt-codes e.g. integral (∫), square-root(√), infinity (∞), (≈) => these all match the word 'and' - useless!?
squared (²) => also matches '2', which makes it useless
Greek letters:
capital Sigma (Σ) => returns both Σ and lower σ , which is meaningless in science (instead of if you were searching for Greek-language, in which case it would be correct).
Similarly θ => θ,Θ
α,β,λ,µ...
Anyone find anything else inconsistent, weird or quirky with any other symbols?
Maybe if we kick up a fuss they might fix this... please contribute...
[Note: if you post an URL in reply, put it in the 'Sources' box]
2007-06-25
21:27:59
·
3 answers
·
asked by
smci
7
in
Science & Mathematics
➔ Mathematics
Northstar - interesting comments
Kaksi - "if we kicked up a fuss it wouldn't help; they cannot fix it!"
Yes they can index these things, they only need to change the encoding used in internal character representation. At the moment, it doesn't even do the ANSI 256-character set, thus my comment on '√'.
Indexing the symbols to the word 'and' is plain silly.
Let alone other encodings (e.g. foreign languages).
"you are lucky to find Greek Σ and σ" because your national MS Office is Greek"
Nonsense - it isn't and no I'm not Greek!
Σ is simply Alt-Numeric-228 ...
http://www.usefulshortcuts.com/alt-codes/maths-alt-codes.asp
... and σ can be copied from the full
"HTML 4.0 Character Set"
http://www.alanwood.net/demos/ent4_frame.html
In maths & science, 'Σ' means summation, whereas 'σ' means standard deviation (or conductivity, or whatever else) - so the two are totally unrelated, in this context. That is not massively complicated to implement.
2007-06-25
23:49:52 ·
update #1
kaksi - Yes, they could internally represent with Unicode which is a standardized and platform-independent
representation whereas Alt-codes are not. I was simply showing you how to input the range of scientific symbols, to search for. I also gave a link to a full list of HTML encodings.
YA could very easily index with Unicode, and the 2-5 byte length can be compressed away (e.g. Huffman encoding). This is what I'm suggesting, and it's very doable.
PS You're the first person ever to call my avatar Greek, why?
2007-06-27
09:53:05 ·
update #2