(PHP 4, PHP 5, PHP 7, PHP 8)
strcasecmp — 大文字小文字を区別しないバイナリセーフな文字列比較を行う
大文字小文字を区別しないバイナリセーフな文字列比較を行います。 ここで行う比較は、ロケールを考慮しません。 つまり、ASCII文字だけが、大文字小文字を区別せず比較されます。
string1最初の文字列。
string2次の文字列。
string1 が string2 より小さければ 0 より小さな値を、
string1 が string2 より大きければ 0 より大きな値を、
両者が等しければ 0 を返します。
返却される値の絶対値に、特に意味はありません。
| バージョン | 説明 |
|---|---|
| 8.2.0 |
この関数は、2つの文字列の長さが等しくない場合に
strlen($string1) - strlen($string2) を返すとは限らなくなりました。
代わりに、-1 や 1 を返す可能性があります。
|
例1 strcasecmp() の例
<?php
$var1 = "Hello";
$var2 = "hello";
if (strcasecmp($var1, $var2) == 0) {
echo '$var1 is equal to $var2 in a case-insensitive string comparison';
}
?>
A simple multibyte-safe case-insensitive string comparison:
<?php
function mb_strcasecmp($str1, $str2, $encoding = null) {
if (null === $encoding) { $encoding = mb_internal_encoding(); }
return strcmp(mb_strtoupper($str1, $encoding), mb_strtoupper($str2, $encoding));
}
?>
Caveat: watch out for edge cases like "ß".I didn't see any explanation in the documentation as to precisely how the positive/negative return values are calculated for unequal strings.
After a bit of experimentation it appears that it's the difference in alphabetical position of the first character in unequal strings.
For example, the letter 'z' is the 26th letter while the letter 'a' is the 1st letter:
<?php
$zappl = "zappl";
$apple = "apple";
echo strcasecmp($zappl, $apple); #outputs 25 [26 - 1]
echo strcasecmp($apple, $zappl); #outputs -25 [1 - 26]
?>
This might be incredibly obvious to most people, but hopefully it will clarify the calculation process for some others.The sample above is only true on some platforms that only use a simple 'C' locale, where individual bytes are considered as complete characters that are converted to lowercase before being differentiated.
Other locales (see LC_COLLATE and LC_ALL) use the difference of collation order of characters, where characters may be groups of bytes taken from the input strings, or simply return -1, 0, or 1 as the collation order is not simply defined by comparing individual characters but by more complex rules.
Don't base your code on a specific non null value returned by strcmp() or strcasecmp(): it is not portable. Just consider the sign of the result and be sure to use the correct locale!Don't forget this is a single-byte function: in Unicode strings it'll provide incoherent results as soon as both strings differ only in case. There doesn't seem to exist a built-in multi-byte alternative so you need to write your own, taking into account both character encoding and collation.