strcasecmp

(PHP 4, PHP 5, PHP 7, PHP 8)

strcasecmp — 大文字小文字を区別しないバイナリセーフな文字列比較を行う

説明

strcasecmp(string $string1, string $string2): int

大文字小文字を区別しないバイナリセーフな文字列比較を行います。ここで行う比較は、ロケールを考慮しません。つまり、ASCII文字だけが、大文字小文字を区別せず比較されます。

パラメータ

string1: 最初の文字列。
string2: 次の文字列。

戻り値

string1 が string2 より小さければ 0 より小さな値を、 string1 が string2 より大きければ 0 より大きな値を、両者が等しければ 0 を返します。返却される値の絶対値に、特に意味はありません。

変更履歴

バージョン	説明
8.2.0	この関数は、2つの文字列の長さが等しくない場合に `strlen($string1) - strlen($string2)` を返すとは限らなくなりました。代わりに、`-1` や `1` を返す可能性があります。

例

例1 strcasecmp() の例

<?php
$var1 = "Hello";
$var2 = "hello";
if (strcasecmp($var1, $var2) == 0) {
 echo '$var1 is equal to $var2 in a case-insensitive string comparison';
}
?>

参考

strcmp() - バイナリセーフな文字列比較
preg_match() - 正規表現によるマッチングを行う
substr_compare() - 指定した位置から指定した長さの 2 つの文字列について、バイナリ対応で比較する
strncasecmp() - バイナリセーフで大文字小文字を区別しない文字列比較を、最初の n 文字について行う
stristr() - 大文字小文字を区別しない strstr
substr() - 文字列の一部分を返す

Found A Problem?

Learn How To Improve This Page • Submit a Pull Request • Report a Bug

+add a note

User Contributed Notes 4 notes

down

chris at cmbuckley dot co dot uk ¶

13 years ago

A simple multibyte-safe case-insensitive string comparison:

<?php

function mb_strcasecmp($str1, $str2, $encoding = null) {
 if (null === $encoding) { $encoding = mb_internal_encoding(); }
 return strcmp(mb_strtoupper($str1, $encoding), mb_strtoupper($str2, $encoding));
}

?>

Caveat: watch out for edge cases like "ß".

down

chrislarham at NOSPAM dot outlook dot com ¶

7 years ago

I didn't see any explanation in the documentation as to precisely how the positive/negative return values are calculated for unequal strings.

After a bit of experimentation it appears that it's the difference in alphabetical position of the first character in unequal strings.

For example, the letter 'z' is the 26th letter while the letter 'a' is the 1st letter:

<?php

 $zappl = "zappl";
 $apple = "apple";

 echo strcasecmp($zappl, $apple); #outputs 25 [26 - 1]
 echo strcasecmp($apple, $zappl); #outputs -25 [1 - 26]

?>

This might be incredibly obvious to most people, but hopefully it will clarify the calculation process for some others.

down

Anonymous ¶

23 years ago

The sample above is only true on some platforms that only use a simple 'C' locale, where individual bytes are considered as complete characters that are converted to lowercase before being differentiated.

Other locales (see LC_COLLATE and LC_ALL) use the difference of collation order of characters, where characters may be groups of bytes taken from the input strings, or simply return -1, 0, or 1 as the collation order is not simply defined by comparing individual characters but by more complex rules.

Don't base your code on a specific non null value returned by strcmp() or strcasecmp(): it is not portable. Just consider the sign of the result and be sure to use the correct locale!

down

alvaro at demogracia dot com ¶

15 years ago

Don't forget this is a single-byte function: in Unicode strings it'll provide incoherent results as soon as both strings differ only in case. There doesn't seem to exist a built-in multi-byte alternative so you need to write your own, taking into account both character encoding and collation.

+add a note