利用电脑玩Android版“天天连萌”刷高分（二）——图像识别

貌似掉线

浏览: 256376 次
性别:
来自: 深圳

最近访客更多访客>>

foquanlin

xx5333

jiangningqu70

flynet

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

Android笔记
JAVA笔记

图像识别汉明距离算法天天连萌图片hash

上一篇只是提到了在PC端利用android sdk里面的工具进行截图，接下来这一篇将补充一点关于上一篇的内容，然后介绍一下程序的整个结构，以及如何进行《天天连萌》里面的图像识别和消除的搜索算法。

一、补充上篇的内容
首先补充一下上一篇忘了提及的内容。
在使用chimpchat时，需要添加几个jar包。这方面网上的资料很少，不过功夫不负有心人，嘿嘿。
需要添加的jar包如下：

chimpchat.jar
common.jar
ddmlib.jar
guava-13.0.1.jar

以上jar包都可以在android sdk里面的tools/lib目录中找到。

二、程序的设计
在这个程序里面我主要写了4个java文件：

Main.java 只有一个main方法，程序的入口。
Robot.java，程序的核心部分，进行游戏截图、图像转换为数组，搜索消除等。里面包含一个LianlianKan的内部类，它是用于搜索可以消除的方块的工具类。
Point.java，表示在数组中的坐标位置的对象。
ImageHash.java，图像识别的算法类，采用汉明距离算法进行图片相似检测。

三、图像识别及转换。
首先先截一张游戏界面的图。游戏里的方块是分布在中心的一个10*5（方块大小）的区域中的。所以先截下图，通过工具取得它的4个边的边距。以我的手机为例，它是800*480的分辨率的，截的图是竖屏的，左边距为48，右边距为72，上边距及下边距为115。每个游戏方块为57*72。如下图所示：

所以在Robot.java需要定义以上相关常量，代码如下：

/**
 * 主要操作类，进行截图，识别，转换，消除，按键等。
 * 
 * @author Geek_Soledad <a target="_blank" href=
 *         "http://mail.qq.com/cgi-bin/qm_share?t=qm_mailme&email=XTAuOSVzPDM5LzI0OR0sLHM_MjA"
 *         style="text-decoration:none;"><img src=
 *         "http://rescdn.qqmail.com/zh_CN/htmledition/images/function/qm_open/ico_mailme_01.png"
 *         /></a>
 */
public class Robot {
	/**
	 * 屏幕宽，视手机而修改
	 */
	private static final int SCREEN_WIDTH = 480;
	/**
	 * 屏幕高，视手机而修改
	 */
	private static final int SCREEN_HEIGHT = 800;

	/**
	 * 左边距，视手机而修改
	 */
	private static final int PADDING_LEFT = 48;
	/**
	 * 右边距，视手机而修改
	 */
	private static final int PADDING_RIGHT = 72;
	/**
	 * 上边距，视手机而修改
	 */
	private static final int PADDING_TOP = 115;
	/**
	 * 下边距，视手机而修改
	 */
	private static final int PADDING_BOTTOM = 115;
	/**
	 * 游戏方块列数
	 */
	private static final int BOX_COL = 5;
	/**
	 * 游戏方块行数
	 */
	private static final int BOX_ROW = 10;
	/**
	 * 图片宽
	 */
	private static final int IMAGE_WIDTH = (SCREEN_WIDTH - PADDING_LEFT - PADDING_RIGHT) / BOX_COL;
	/**
	 * 图片高
	 */
	private static final int IMAGE_HEIGHT = (SCREEN_HEIGHT - PADDING_TOP - PADDING_BOTTOM)
			/ BOX_ROW;
	/**
	 * 截除的边角宽，视手机而修改
	 */
	private static final int CORNER_WIDTH = 24;
	/**
	 * 截除的边角高，视手机而修改
	 */
	private static final int CORNER_HEIGHT = 27;

	/**
	 * 数组行数
	 */
	private static final int CODE_ROW = 12;
	/**
	 * 数组列数
	 */
	private static final int CODE_COL = 7;

	// ...
}

上面定义的常量当中还有CORNER_WIDTH及CORNER_HEIGHT，这是因为有时有些方块会有道具标示，或者是是“*2”的分数提示，所以截取小图进行图像识别的时候还要避开这一点。然后上面提到的数组行列数，这里的数组不是取得的图像的数组。而是为了便于比较计算，将每个图像对应一个int数字，bomb或为空时为0。它的行及列分别为12和7，而不是10和5，是因为考虑到连连看的规则而加上的外围边界。如下图所示：

然后还要定义两个成员变量，一个表示取得的方块矩阵，一个表示对应的数字矩阵，代码如下：

	private BufferedImage images[][] = new BufferedImage[BOX_ROW][BOX_COL];
	/**
	 * 表示图片的数组，为12 * 7个。 图片共有10*5个单位，但是在进行路径计算的时候还要考虑四周，所以是12 * 7 个单位。
	 */
	private int imageCodes[][];

从截屏取得的大图中获取方块小图代码如下：

		for (int i = 0; i < images.length; i++) {
			for (int j = 0; j < images[i].length; j++) {
				images[i][j] = image.getSubimage(j * IMAGE_WIDTH + PADDING_LEFT + 3, i
						* IMAGE_HEIGHT + PADDING_TOP + 3, IMAGE_WIDTH - CORNER_WIDTH - 3,
						IMAGE_HEIGHT - CORNER_HEIGHT - 3);
			}
		}

上面代码中的+3及-3，是为了不计算方块的边界，可看情况修改，因为当游戏中出现提示时，方块的边框是有流动动画的，它如果被计算在内的话也会影响图像识别。
取得图之后，我们可以通过计算每个图像对应的hash值，然后对每种图像定义一个值，这样在游戏开始后，通过汉明距离算法就可以把图像转换为一个二维数组，进行连连看的方块消除搜索。当然，采用这一算法，需要先截得包含这些方块图像的游戏界面，然后计算出它们各自的hash值，进行存储。除去bomb，共有28种方块图像，多进行几次游戏就可以全截取到了。关于图片转换为hash值及进行相似判断的代码将在后面给出。
下面贴上将方块图片转换为int二维数组的代码，其中image是截取的屏幕的大图，然后取得小图边计算它的hash值，再调用distance判断是哪个图像方块：

	/**
	 * 通过获取的截图设置num数组
	 */
	public void setNum(BufferedImage image) {
		imageCodes = new int[CODE_ROW][CODE_COL];
		for (int i = 0; i < images.length; i++) {
			for (int j = 0; j < images[i].length; j++) {
				images[i][j] = image.getSubimage(j * IMAGE_WIDTH + PADDING_LEFT + 3, i
						* IMAGE_HEIGHT + PADDING_TOP + 3, IMAGE_WIDTH - CORNER_WIDTH - 3,
						IMAGE_HEIGHT - CORNER_HEIGHT - 3);
				String hash = mImgHash.getHash(images[i][j]);
				int minDis = Integer.MAX_VALUE;
				for (int k = 0; k < GAME_IMAGE.length; k++) {
					int dis = mImgHash.distance(GAME_IMAGE[k], hash);
					if (dis <= 8 && dis < minDis) {
						imageCodes[i + 1][j + 1] = k + 1;
						minDis = dis;
						if (minDis <= 0) {
							break;
						}
					}
				}
				// System.out.print(imageCodes[i + 1][j + 1] + "\t");
			}
			// System.out.println();
		}
	}

上面的mImgHash是一个ImageHash对象，也是我定义的实现汉明距离算法的类。GAME_IMAGE存储了每个方块图像对应的hash值，是之前计算得到的，定义如下：

public class Robot {
	/**
	 * 表示每个方块图像的HASH值
	 */
	private static final String[] GAME_IMAGE = {
			"0110000100110010101000110111110000010010101001110"/* 煎蛋 */,
			"0000001100000000011110100101100110111100000110000"/* 紫猫 */,
			"0010000101010101010100101110100000110101001101111"/* 白菜 */,
			"0000001001101001100011100110101110010100000010011"/* 茄子 */,
			"1001100100000101001100100111001100101110110100010"/* 兔子 */,
			"1000010001000101111010100100100011010010111011000"/* 莲藕 */,
			"0010010010100100101100110101100011011010010010001"/* 红虾 */,
			"1000000000101000100010111000110000011001111011100"/* 玉米 */,
			"0001100001100101101001010100111001101010110101100"/* 闪电 */,
			"0000000000100001010100011111101010010000000100000"/* 狐狸 */,
			"1100000101000000011110111011010011011100001100000"/* 白云 */,
			"1000011000110101100000110100010001110011001100000"/* 菠萝 */,
			"1000111001100101101010110100100001110110101011000"/* 草莓 */,
			"0000000001110011100001000011001110111001001100110"/* 蘑菇 */,
			"1111011000110100111001110100000101011000011100111"/* 蓝鼠 */,
			"1000000000001000111100110000110011110000011011101"/* 太阳 */,
			"1001100000001101100010111000110001010001110011100"/* 月亮 */,
			"1011100000001110100100101100010100101001011011010"/* 雪人 */,
			"1000011001101101011000111100101001111100011001100"/* 熊猫 */,
			"1000000000000001010100001110101010111000010100000"/* 黄熊 */,
			"1000000110100011001110111001100110001100001001011"/* 彩虹 */,
			"1010100000001001001000101001010111011000111001000"/* 雪花 */,
			"1000110001110001100010100000100111010100010010000"/* 西瓜 */,
			"1000001101011000011110110001010011001001010110110"/* 香蕉 */,
			"0001100000001101000001000000001100001001110100000"/* 蓝果实 */,
			"1000100011101111010001000110001000000010010100110"/* 葡萄 */,
			"0000100000011100110000101010100000111000101000111"/* 红果实 */,
			"1001100000110001000110110011001110001011000110011"/* 黄梨 */, };
	// ...
}

在遍历GAME_IMAGE数组进行每个方块的相似识别时，通常距离小于5的都可以认为是相似图片，在这里会比较与哪一个图片的hash距离最小是因为取得的方块图片都较小，为避免识别错误需要对每个图片都进行距离判断（因为一开始我取的识别精度并不是5，而是8，极少数情况下会有图片识别错误）。

最后附上汉明距离算法的代码：

 * @(#)ImageHash.java	       Project:lianmeng
 * Date-Time:2013-10-11 下午7:40:20
 *
 * Copyright (c) 2013 CFuture09, Institute of Software, 
 * Guangdong Ocean University, Zhanjiang, GuangDong, China.
 * All rights reserved.
 *
 * Licensed under the Apache License, Version 2.0 (the "License");
 *  you may not use this file except in compliance with the License.
 * You may obtain a copy of the License at
 *
 *     http://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */
package pw.msdx.lianmengassistant;

import java.awt.Graphics2D;
import java.awt.color.ColorSpace;
import java.awt.image.BufferedImage;
import java.awt.image.ColorConvertOp;

/*
 * pHash-like image hash. 
 * Author: Elliot Shepherd (elliot@jarofworms.com
 * Based On: http://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html
 * 
 * Optimize by Geek_Soledad.
 */
public class ImageHash {

	private int size = 8;
	private int smallerSize = 8;

	public ImageHash() {
		initCoefficients();
	}

	public ImageHash(int size, int smallerSize) {
		this.size = size;
		this.smallerSize = smallerSize;

		initCoefficients();
	}

	public int distance(String s1, String s2) {
		int counter = 0;
		for (int k = 0; k < s1.length(); k++) {
			if (s1.charAt(k) != s2.charAt(k)) {
				counter++;
			}
		}
		return counter;
	}

	// Returns a 'binary string' (like. 001010111011100010) which is easy to do
	// a hamming distance on.
	public String getHash(BufferedImage img) {
		/*
		 * 1. Reduce size. Like Average Hash, pHash starts with a small image.
		 * However, the image is larger than 8x8; 32x32 is a good size. This is
		 * really done to simplify the DCT computation and not because it is
		 * needed to reduce the high frequencies.
		 */
		img = resize(img, size, size);

		/*
		 * 2. Reduce color. The image is reduced to a grayscale just to further
		 * simplify the number of computations.
		 */
		img = grayscale(img);

		double[][] vals = new double[size][size];

		for (int x = 0; x < img.getWidth(); x++) {
			for (int y = 0; y < img.getHeight(); y++) {
				vals[x][y] = getBlue(img, x, y);
			}
		}

		/*
		 * 3. Compute the DCT. The DCT separates the image into a collection of
		 * frequencies and scalars. While JPEG uses an 8x8 DCT, this algorithm
		 * uses a 32x32 DCT.
		 */
		double[][] dctVals = applyDCT(vals);

		/*
		 * 4. Reduce the DCT. This is the magic step. While the DCT is 32x32,
		 * just keep the top-left 8x8. Those represent the lowest frequencies in
		 * the picture.
		 */
		/*
		 * 5. Compute the average value. Like the Average Hash, compute the mean
		 * DCT value (using only the 8x8 DCT low-frequency values and excluding
		 * the first term since the DC coefficient can be significantly
		 * different from the other values and will throw off the average).
		 */
		double total = 0;

		for (int x = 0; x < smallerSize; x++) {
			for (int y = 0; y < smallerSize; y++) {
				total += dctVals[x][y];
			}
		}
		total -= dctVals[0][0];

		double avg = total / (double) ((smallerSize * smallerSize) - 1);

		/*
		 * 6. Further reduce the DCT. This is the magic step. Set the 64 hash
		 * bits to 0 or 1 depending on whether each of the 64 DCT values is
		 * above or below the average value. The result doesn't tell us the
		 * actual low frequencies; it just tells us the very-rough relative
		 * scale of the frequencies to the mean. The result will not vary as
		 * long as the overall structure of the image remains the same; this can
		 * survive gamma and color histogram adjustments without a problem.
		 */
		StringBuilder hash = new StringBuilder();

		for (int x = 0; x < smallerSize; x++) {
			for (int y = 0; y < smallerSize; y++) {
				if (x != 0 && y != 0) {
					hash.append(dctVals[x][y] > avg ? "1" : "0");
				}
			}
		}
		return hash.toString();
	}

	private BufferedImage resize(BufferedImage image, int width, int height) {
		BufferedImage resizedImage = new BufferedImage(width, height, BufferedImage.TYPE_INT_ARGB);
		Graphics2D g = resizedImage.createGraphics();
		g.drawImage(image, 0, 0, width, height, null);
		g.dispose();
		return resizedImage;
	}

	private ColorConvertOp colorConvert = new ColorConvertOp(
			ColorSpace.getInstance(ColorSpace.CS_GRAY), null);

	private BufferedImage grayscale(BufferedImage img) {
		colorConvert.filter(img, img);
		return img;
	}

	private static int getBlue(BufferedImage img, int x, int y) {
		return (img.getRGB(x, y)) & 0xff;
	}

	// DCT function stolen from
	// http://stackoverflow.com/questions/4240490/problems-with-dct-and-idct-algorithm-in-java

	private double[] c;

	private void initCoefficients() {
		c = new double[size];

		for (int i = 1; i < size; i++) {
			c[i] = 1;
		}
		c[0] = 1 / Math.sqrt(2.0);
	}

	private double[][] applyDCT(double[][] f) {
		int N = size;
		double[][] F = new double[N][N];
		for (int u = 0; u < N; u++) {
			for (int v = 0; v < N; v++) {
				double sum = 0.0;
				for (int i = 0; i < N; i++) {
					for (int j = 0; j < N; j++) {
						sum += Math.cos(((2 * i + 1) / (2.0 * N)) * u * Math.PI)
								* Math.cos(((2 * j + 1) / (2.0 * N)) * v * Math.PI) * (f[i][j]);
					}
				}
				sum *= ((c[u] * c[v]) / 4.0);
				F[u][v] = sum;
			}
		}
		return F;
	}

}

查看图片附件

分享到：

利用电脑玩Android版“天天连萌”刷高分（ ... | 利用电脑玩Android版“天天连萌”刷高分（ ...

2013-10-20 15:44
浏览 1480
评论(0)
分类:移动开发
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论