当前位置：首页 > news >正文

java8-模拟hadoop

news 来源：原创 2024/4/26 9:27:55

hadoop的入门程序，java8也能实现

txt统计单词数量程序

@Test
public void fileWordCount() throws IOException {
    //特殊文件需要格式转换为txt
    Files.readAllLines(Paths.get("D:\\jd.txt"), StandardCharsets.UTF_8).parallelStream()
            //将多个流融合为一个
            .flatMap(line -> Arrays.stream(line.trim().split("\\s")))
            .filter(word -> word.length() > 0)
            .map(word -> new AbstractMap.SimpleEntry<>(word, 1))
            .collect(groupingBy(AbstractMap.SimpleEntry :: getKey, counting()))
            .entrySet().forEach(System.out :: println);
}

List统计单词数量程序

@Test
public void listWordCount(){
    List<String> stringList = Arrays.asList("a","b","c","a");
    stringList.stream()
            .map(s -> new AbstractMap.SimpleEntry<>(s, 1))
            .collect(groupingBy(AbstractMap.SimpleEntry :: getKey, counting()))
            .entrySet().stream()
            .forEach(System.out :: println);
    System.out.println("---------------------------------------------------");
    //通过自定义reduce统计，其实counting（）也使用的是reduce
    //记住：凡是在中间操作使用了map，接口定义都需要声明出来，直接使用lambda表达式会有1.无法读取method，2.类型检查不到 的问题
    BinaryOperator<Integer> binaryOperator2 = Integer::sum;
    //排序的转换规则接口
    ToIntFunction<Map.Entry> sortMapFunction = (Map.Entry se) -> Integer.valueOf(se.getValue().toString()).intValue();
    stringList
            .stream()
            .map(s -> new AbstractMap.SimpleEntry<>(s, 1))
            .collect(groupingBy(AbstractMap.SimpleEntry::getKey,
                    reducing(0, AbstractMap.SimpleEntry::getValue,binaryOperator2)))
            .entrySet()
            .stream()
            .sorted(Comparator.comparingInt(sortMapFunction))
            .forEach(System.out::println);
}

第 13 章 Barman for PostgreSQL

spark 源码编译 standalone 模式部署

在华为设备上实施GRE隧道和IPSEC ***

如何在数据库动态建表

十年阿里java架构师的六大设计原则和项目经验

基于 python + WebDriverAgent 的“跳一跳”小程序高分教程

json logstash 解析失败 ctrl-code 1

5-2 equal getClass or instanceOf

linux kernel编译配置相关

不要在构造函数中抛出异常

老男孩教育教您批量建立nagios配置文件的方法

使用jQuery获取session中存储的list集合